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A Unified View of Synchronous Data 


Transmission System Design 


By JAMES W. SMITH 
(Manuscript received October 5, 1967) 


This paper illustrates the basic equivalence of many of the linear data 
transmission design techniques. It shows the untfying feature of these 
techniques to be a generalization of Nyquist’s original ideas relating time 
samples and frequency domain constraints. It examines pulse amplitude 
(with and without constraints on the input data) and pulse shape modula- 
tion systems, and shows their relationships. It uses a number of previously- 
described systems to illustrate the range of possibilities of the very general 
design approach. This paper presents some new results on noise and channel 
parameter monitoring and on spectrum shifting by constraining the input 
data. 


I. INTRODUCTION 


Over the years, many seemingly different techniques have been 
proposed for synchronous data transmission. Unfortunately, the litera- 
ture devoted to these techniques tends to expand the differences be- 
tween a specific system and all other systems. It is our purpose to 
show the basic equivalence of the various techniques; hence, to show 
a unified view of the field. In doing this we examine some well known 
and some relatively unknown transmission systems in a new light 
and propose some new techniques. 

The unified design view that we take here is basically a generaliza- 
tion of Nyquist’s ideas which have recently been expanded upon by 
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Gibby and Smith.? This really is the thread which ties together vir- 
tually all of the literature on synchronous data transmission. Section 
II summarizes the basic ideas of these two papers. 

Section III describes the model of the general, linear, data transmis- 
sion system to be considered. The system uses M channels and is de- 
scribed by means of an input data vector rather than by any state- 
ments about the transmitter characteristics. Thus, one type of data 
vector implies a pulse amplitude modulated system (PAM) while an- 
other type of data vector implies pulse shape modulation (PSM) such 
as frequency shift keying (fsk) or pulse position modulation (PPM.) 

In Section IV, we use Nyquist’s approach to find the design con- 
straints for distortionless transmission (no intersymbol or interchannel 
interference). This section shows that the conditions for distortionless 
transmission depend upon the input data vector description; hence, 
different design constraints result from PAM or PSM transmission. 

Section V illustrates the application of the constraints to some 
special cases. These include: 


({) Linear precoding and decoding for PAM transmission. 

(it) The use of band-limited orthogonal signals for multichannel 
PAM transmission. 

(21) Noise monitoring in PAM systems. 

(tv) Binary PSM transmission (including the specific case of 
Sunde’s FM model with a linear receiver instead of a phase derivative 
receiver). 

(v) Parameter monitoring in PSM systems. 

(vz) Zero stuffing techniques for shifting spectrum location. (The 
section shows this to have some promise for voice channel transmis- 
sion without carrier modulation.) 


Il. THE UNIFYING VIEW 


In designing a data system, one usually starts with a desired time 
response for the total system. Because it is only necessary to examine 
the output signal at fixed times (for example, at 7’ second inter- 
vals where 1/T is the rate at which independent symbols are being 
transmitted), one needs to specify the over-all response at those times 
(for example, t = kT, all k). Since the total response of the transmit- 
ting filter, the channel, and the receiving filter is easier to determine 
in the frequency domain than in the time domain, one must relate the 
time response constraints to frequency domain constraints. 
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This briefly is the basic problem attacked by Nyquist. Here is a 
summary of his results, as expanded by Gibby and Smith. 
Any time function r(t) with Fourier transform R (wo) 


WS _ [ _Riel** de (1) 
has sample values at multiples of 7 seconds which may be written 
rqT) =r, = - [ R()e’?*” dw (2a) 
(Q2n+1)4r/T ; 
San | Rel? de. (2b) 
2m pene J (an—1) 2/7 
Or, changing the variable, 
7 => ss = Rw + 2nn/Tei**? du. (26) 
Assuming that 
>> Ru + 2nxr/T) 


is a uniformly convergent series, one obtains 


-if" >> Riu + 2ne/Te"*? du. (24) 


t/T n=-% 
Noting that r, is sist the g coefficient of an exponential Fourier series 
expansion of 


>} Rw + 2nr/T) — —x/T Su So/T, 


one obtains 


= > Riu + 2nn/T) = 3 rei" —r/TSusxr/T (3) 
which is very closely related to the Poisson sum formula.’ Throughout 
this paper the reader should keep in mind the interval —7r/T S u S 
aw/T; we will not be repeating it explicitly. 

Equation 3 relates a function of the frequency domain characteristic 
(namely the sum of the values at frequency intervals 27/7) to the 
time response constraints r, which will be chosen for a particular trans- 
mission scheme. Fig. 1 illustrates several frequency characteristics which 
satisfy the time response constraint that r, = 0 forg ¥ 0. When7, ~ 0, 
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Fig. 1— Such constraints on R(w) that r(q7) = 0, ¢ 40. 
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we have the usual design for PAM transmission without intersymbol 
interference. When ry = 0, we have a response which is useful when 
crosstalk between channels is to be eliminated. That is, the output 
of a signaling path whose time response has r, = 0 for all g contains no 
information about the input data at the sampling times. For further 
discussion of equation 3 see Ref. 2. 


III. THE GENERALIZED TRANSMISSION MODEL 


The optimum theoretical method (in the sense of minimizing error 
probability) for transmitting data through a Gaussian channel con- 
sists of waiting until all data have been accumulated at the transmit- 
ter and then sending a single waveform to represent the entire mes- 
sage. The optimum receiver (in the presence of white noise) consists 
of filters matched to each message waveform. The disadvantage of 
this form of communication lies in the fact that transmitter and re- 
ceiver complexity grows exponentially with message length. 

Thus, system designers usually restrict system complexity by not 
waiting for the entire message before transmitting. Short portions of 
the message can be encoded systematically and transmitted sequen- 
tially as they arrive using relatively simple terminal equipment. 

Fig. 2 illustrates the general approach to transmission system de- 
sign considered in this paper. The input data samples, an,, m = 1, 2, 

. , M are applied at t = kT to the M signal generators A,,(o). 
The sequence {a,,;} can be considered to be a random sequence of 
impulses of weight @,,, (where a»; 18 in general multilevel) and spaced 
T seconds apart. Since there are M signal generators, symbols are be- 


g, (t) 


ee) M 
ey Dd Aamk Pm (t-kT) 
k=-00 m=! =Qo(t) 





Fig. 2— General transmission system. 
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ing sent at the rate M/T symbols per second. The receiver consists of 
M linear filters, C,(o), p = 1,2,...,M. 

The channel input represents the sum of the transmitter outputs 
and may be written 


M ie) 


DS De Gitalt —- kL) 


m=1 k=—0 


where 


nae x: i A, (ael* de. 


Then, using 


Pnp(t) = oe / An(o)Blo)C,(w)e'*! deo (4) 
the output of the p (p = 1,2,..., M) filter may be written 
M i) 
g() = a DS Anil mp(t — kT). (5a) 


m=1 k=-0 
It will be assumed that any decisions will be made on the basis of the 
output waveform at integral multiples of 7 seconds. These output sam- 
ples at the time f=IT, 


AM ce) 


Jot = gr(lT) = 2 p> Gail nl T a kT) (5b) 
m=1 k=—-00 
may also be written in vector notation as 


Int = > TUT 3 kT) Gy (5c) 
k 


=—00 


where 


7,17 — kT) = [r,,(IT — kT), 12,(IT — kT), «++ ,ry,(IT — kT)] (6) 


and 


a ip (7) 


Ok 


The model described, then, represents a general pulse amplitude 
modulation system. For example, if the elements of @, are random 
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multilevel values and the transmitter and receiver are systematically 
chosen to be 


A,(w) = Ao(w — en) + Ao(—@ — &m) (8a) 
C,(w) = Cow — wn) + Col(—w — wm) (8b) 


one has a frequency division multiplex PAM system. Likewise, choos- 
ing . 


A,,(w) = A(w) exp | - jm Det | (9a) 


Ca(0) = Cl) exp | j= Mor | (9b) 


leads to a time division multiplex PAM system. 

For the model to be as general as possible, it should include the 
possibility of using multiple waveshapes to convey information. This 
can be accomplished by restricting the k data word to be 


1] fol fo 0 
o| }1]} Jo 0 

d@,=|0},/0],/1}--- or [0}- (10) 
o/ Lo! Lo 1 


Thus, the system transmits one of M possible waveforms in each time 
slot and includes such modulation techniques as FM, PM, pulse posi- 
tion or pulse duration modulation. 


IV. DESIGN CRITERIA FOR THE GENERALIZED MODEL 


The general design constraints for the two different interpretations 
of @, are imposed by the requirement of distortionless transmission 
(that is, no intersymbol or interchannel interference). Each of the two 
cases leads to a different definition of distortionless transmission and 
hence to different design constraints. 


4.1 Pulse Amplitude Modulation 


(Elements of @, are random and multilevel.) 
The output of the p™ receiver filter at t = IT is 


Jot = >» 7 c- kT) -a, . (5c) 


=O 
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For distortionless transmission it is required that this output depend 
only upon the input value a,:; that is, 
Int = Aik, (11) 
where K, is a constant that depends on the p™ channel 
A, (w) B(w)C,(w). This requirement constrains the time response* 
fee (AS = kT) = So mp 61,0 > (12) 


where 


bmp = 0 m#zD 
= 1 m =p. 


Using equations 3, 4, and 12, the time domain constraint becomes 
the frequency domain constraint 


LS ( 2nny( 2am a, ( an) 
pou, An w+ Blu + a JColu + pr) = bmn Kp 13) 
This is a generalized Nyquist criterion which applies to all linear 
PAM systems. 

Notice that equation 13 represents M? equations which must be 
satisfied by 


MT/z X positive frequency range of nonzero B (w) 


independent variables. Therefore, B(w) must have a radian band- 
width of at least Mz/T for M channel distortionless transmission. 


4.2 Pulse Shape Modulation 
(a, given by equation 10.) 


The definition of distortionless transmission of the previous part 
(equation 11) could also be applied here. However, it is possible to use 
a different definition with quite interesting results because of the con- 
straint upon @;, . Here, distortionless transmission will require that the 
output of the p filter at t = IT be 


Int = OK ae Ky. (14) 


where K,; and Kye are constants. That is, the output of the p™ re- 
ceiver takes on one of two values, at ¢ = IT, Ky, + Kye or Kye depend- 
ing upon the value of dp. 

This definition of distortionless transmission eases the constraints 
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upon the various time responses. Examining 


Gn = Dd, F(T — kT) -G, (5e) 
k=-@ 

it is seen that all elements of 7,(/7 — kT) must be identical but not 
necessarily zero for k # 1; that is, 


Tnpll — kT) = 7,,(00F — kT) all m, ¢. (15) 


With this condition satisfied, g,(l17) is independent of a, for k # 1 
(that is, the information transmitted at times other than /T). Next it 
is required that all elements of 7,(0) be identical except 7,,(0) (that is, 
Tmp(O) = 1ep(0) m, g ¥ p). Thus, g,(/7) will take on one value if a,, = 1 
and a different value if any other a,, = 1 q ¥ p. These statements may 
be summarized by the equation 


mp UT = kT) = F otk + One 51,Gp (16) 


where F,,, ;_; and G, are constants which depend only on the subscripts 
and are independent of m. 

Using (3), (4), and (16), the time domain constraints become the 
frequency domain constraints 


1< Qnr Qn Qnr 
fo Anu a 27 au 7 =n Na,(u + 2) 


NA—O 


= binp G, + oy Fico (17a) 
k=- 0 


l 
= mp Gp + Fu) (17b) 


recognizing that the last term is really a Fourier series expansion. 
Notice that there is a good bit of freedom in the design because F, (wu) 
can be chosen arbitrarily. Alternatively, this means that the time 
domain response samples, F’,, ;_;,, can be arbitrarily chosen but, these 
samples must be the same for the response to each transmitter. Thus, 
because the input data has been restricted, the definition of distortion- 
less transmission can be relaxed. 


V. DESIGN CRITERIA APPLICATIONS 


Let us apply the general design criteria derived in the previous sec- 
tion to some special cases to illustrate the principles involved. These 
examples include PAM, PSM, and systems in which the data vector is 
partly independent multilevel and partly constrained (that is, where 
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some of the components of the data vector are unconstrained and the 
rest are forced to be zero). The examples clearly bring out the rela- 
tionship between transmitting information with amplitude or wave- 
form variation. 


5.1 Pulse Amplitude Modulation Systems 


5.1.1 Linear Precoding and Decoding 

Pierce® has suggested the use of linear precoding and decoding ma- 
trices for data systems to improve performance in the presence of im- 
pulse noise. Fig. 3 shows a system using this concept. It differs from 
normal smear-desmear techniques in that there are M channels instead 
of just one (that is, the input data are block-encoded). 

The customer data, now labeled e,,,n = 1,...,N, are applied to 
the precoder at t = kT. The transformed data a»; are then applied to 
the input of the m* transmitter. In terms of the input data, one has 


ax = Pox (18) 
where P is the N by M (M = N) precoder matrix and the input data is 


a, = | |. (19) 


ONk, 


Vi oe D9, (20) 


LINEAR 


TRANS- 
FORM- 
ATION 





Fig. 3— Transmission system using linear precoding and decoding trans- 
formations. 
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where D is the decoding matrix and 


are lion (21) 


and 


I 


eek (22) 


J Mk 


It may be noted that the transmitted signal may be written 


M e) 


>> >» Antbn(t — kT) 


m=1 k=-0 


or in terms of the input signal 


ML N ve) 
DS oy pd PrjOtj1An(t — kT) 


m=1 j7=1 k=—00 


or 
ce) N Af 
>, >» Qik >; PrgAn(t 7 kT) 
k=~-0o j=l m=1 
or 
C-) N 
DD apnai(t — kT) 
k=-w© j=1 
where 
M 
at(t) = DY Prian(l). (23) 
m=1 


Thus, using a linear coder merely corresponds mathematically to us- 
ing a different set of signal generators with no coder. It might be de- 
sirable in some cases to treat the precoder separately,® because it could 
be an easily modified device (that is, one consisting only of gain or 
delay variables) which could be used to combat noise, change the data 
rate or shift the spectrum of the signals on the channel. 
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As an example, consider the two precoding matrices 


1000 0 0 1 00 0 0 
0100 0 0 00010 0 
Poe 8! ae 0 pee ob 000 
00010 0 00001 0 
00001 0 0010 0 0 
00000 1 00000 1 


used with time division multiplex transmitters (that is, serial trans- 
mission 


A,@) = Aw) exp | -i in wet |). 


Matrix P, corresponds to no precoding while matrix P, represents 
interleaving which might be effective in combating burst noise if the 
input data is redundant (that is, digitally encoded into blocks of length 
3, in this case. Notice that interleaving is basically a digital technique 
for error control in burst noise. This is amply illustrated by the presence 
of identical values as the single nonzero element in each row and column 
of the matrix. For analog error control (smearing or spreading the 
information over several symbols) in burst noise, the elements of P 
can be any real values. 

The choice of a particular precoding matrix would presumably be 
based upon some knowledge of the noise characteristics. The decoder 
can then be designed for distortionless transmission by solving DP = Iyy 
if it is assumed that 


i 2 2 2 ; 
wae An(u + Cw f rou + 20) =5,,K,. (13) 


n=—oO 


Similarly, D could be obtained by considering the transformed trans- 
mitter and receiver and solving 


LS aly 4 2 2nm\ 2am — 5. Ke 
7 2. Atl + on p(w +p Jer(u +} = by Ki. 24) 


The transformed receiver C4(w) is just 


Ci) = > DjinC p(w). (25) 


The two approaches are equivalent. 
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5.1.2 A Two-Channel PAM System 

Fig. 4 shows a two-channel PAM system. All of the main features 
of the design constraints can be easily shown by means of this ex- 
ample. The four equations which must be satisfied are 


Ale + aE) au + tou + pF) = Bae 


m,p = 1,2. (13) 


It is apparent that A,,[u + (2nr)/T] and C,[w + (2nr)/T] must 
have nonzero values for at least two values of n (two intervals of 7/T 
bandwidth or two intervals of width 2x/T when both positive and 
negative frequencies are considered). Hence, the total bandwidth must 
be 2x/T for distortionless transmission. If the bandwidth is greater 
than 27/7, an infinite number of designs are possible. 

A clearer idea of the implications of equation 13 can be gained by 
examining the impulse responses in the time domain which are shown 
in Fig. 5. Notice that r3,(t) and 722(t) are the usual pulses required for 
data transmission. The crosstalk waveforms 7r3.(t) and 71e1(t) are re- 
quired to be zero at all t = kT so that the output of either channel at 
t = kT does not depend upon the input to the other channel. This does 
not mean, however, that there can be no frequency overlap between 
the transmitter of one channel and the receiver of the other. It does 
mean that the characteristics must be chosen so that 


LS Aa(u + *)a(u + #)c,(u + 22) = 0 mAzAp. (26) 


Fig. 1 shows one such characteristic and Fig. 6 gives one possible 
design for the two-channel system which anticipates the next example. 

Notice that if As(w) were zero (that is, a single channel system), 
the second receiver could be used for a noise monitor.’ By taking the 


Bik Ty (t-kT). 
+ 54 To, (t-kT) 


Aik M2 (t-kT) 





Fig. 4— Two-channel PAM system. 
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Fig. 5— Required time domain responses for distortionless transmission. 


output of this filter at kT’ seconds (when the noiseless component is 
zero), squaring, and averaging, one can get an estimate of the variance 
of the channel noise. 


5.1.3 Band-Limited Orthogonal Signals for Multichannel Transmission® 


Thus far, we have discussed only general design constraints with- 
out regard for the specific choices that a designer must make if he 
has available more than the minimum bandwidth (as he must). In 
other words, if only the minimum bandwidth were available, the de- 
signer would have no choice but to match the M? equations with the 
M? variables (a slight choice does arise between serial and parallel 
formats). However, arbitrary choices can be made when one has more 
than M? variables (bandwidth > Mzx/T). 

Chang® has considered one such possibility; namely, a frequency 
division multiplex system in which signals at the channel output, 
Am(w)B(w), are orthogonal. In other words, taking B(w) = 1 for 
notational simplicity, Chang’s signals are chosen to satisfy the time 
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domain requirement 
[ : a(i)a,(t — KP) dt = 8, 80 Kp - (27) 
In the frequency domain this requirement becomes 
jaf AnlaDAS ae deo = bap bro Ky (28a) 


Using the technique of equations 1 through 2d, we obtain 


x/T 


> An(u + 28) as(y el? ss du = 8np by K, (28b) 


us 
20 —-nr/T n=—-0 


Ai(@)} —> 


7 


Az(wjet2 —> Ci(w) —> 
JA2@)Ci(w)—»>  Azlw)Co(w)— >  A;(w)Ci(w) —> 


Ca2(w) edd —> 
— jA,(w) Co (w) —> 


) alT anlT 


° 


a{T 2a/lT 


Fig. 6 — A possible two-channel system assuming B(w) = 1. 
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or 


LS alu + 2 )at(u + 2) - 
ple An\u + -JAS\u + a) = one Ke (29a) 
for the frequency domain representation. This equation is identical 
to equation 13 if C,(w) = A*(w) (which is best in the presence of white 
noise) assuming B(w) = 1. [Nonideal B(w) can be considered by as- 
suming that A,,(w) is the channel output rather than transmitter output]. 
Thus, it is seen that the requirement of orthogonality is a special case 
of the general design criteria with the additional constraint that C,(w) = 
A*(w), 

In addition to this constraint, Chang chose a frequency division 
multiplex format with overlapping signal spectra such that 


| A,(w) | ¥ 0 


only for 


(n—D)e<lel<(m ta) 

One can insert these assumptions into equation 29a and arrive at 
at the design conditions. It is, however, more enlightening to examine 
the system in the light of the previous discussion of a two-channel 
system. Fig. 7a shows the spectra of the three transmitters which af- 
fect the output of the mt» channel under the assumptions outlined 
above. (For concreteness of the discussion, m is even; odd m would 
proceed similarly.) No intersymbol interference in the m* channel 
requires 


1 ¥ 2nm +( 2n z 
p An(u + as uta) = Kn. (29b) 
In other words, the characteristic |A,,() |? must have vestigial sym- 
metry about o = mx/T and (m — 1)x/T. 

Let us turn now to the crosstalk terms. The equations which must 
be satisfied for distortionless transmission are 


and 
SS Ansa(u + 2) aa(u + 228) = 0. (30b) 


n=—0 
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[Am-1@)| [An(w)| lAm+1(w) 
(a) 
(m-8)F (m-Z)E  (m-E)E (meg) FE (m4 8) F 
o-_ 


[Am-1(@)A nr w)| 






we ODE ODF 


Im Am-(w) A*(w) 


(Cc) 










[oe) 
D. Am-1(Ut2na/T)Ay (u+2nm/T) 
/{n=-00 


Fig. 7— (a) Spectra of transmitters which affect m‘® output. (b) Magnitude 
of Ama(w)Am*(w). (c) Required Am+(w)Am*(w). (d) Demonstration that above 
Am-s1(w)Am*(w) satisfies constraint for periodic zeros. 


Fig. 7b shows the magnitude of A,,-,(w)A*(w) which is symmetric 
about |w| = (m — 1)x/T. The only way these components can sum 
to zero following equation 30a is if they are imaginary as shown in Fig. 
7c (with the sum given in Fig. 7d). The same argument applies to the 
An+1(w)A*(w) product and is illustrated in Fig. 8. In other words, 
A*(w) must be +90 degrees out of phase with A,,,,(w) and A,n_,(w) 
in the regions of overlap of the functions. 

It is seen that the amplitude characteristic design is based upon the 
condition of no intersymbol interference in each channel and is based 
upon the usual Nyquist design. The remaining freedom in choosing 
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(a) lAm+1@)Am@) | 





(b) Im Ams (@)A,*(w) 





[ee] 
> Ame (Ut2na/T) Am (u+2na/T) 


| hs : 
fli 


-1/T 


| | 


ena—_ 


Fig. 8— (a) Magnitude of, and (b) required Amsa(w)Am*(w). (c) Demonstra- 
tion that above Ams(w)Am*(w) satisfies constraint for periodic zeros. 


the phase characteristic is then used to eliminate interchannel inter- 
ference with the requirement being 


phase of A,,(w) = phase of A,,_,(w) +90°. (31) 


5.1.4 Noise Monitoring’ 


The noise monitoring feature mentioned previously can be gen- 
eralized to the M channel case. The minimum bandwidth of Mx/T 
must be exceeded by the practical system. The bandwidth redundancy 
can be used for noise monitoring by adding another filter Cyr4.1(w) at 
the receiver. This receiver must satisfy the equations 


bE ale ale eae + 


Nn=—- oO 


0 


I 


for lsmsM (32) 


and a nontrivial solution can result because of the bandwidth redun- 
dancy. Then, the impulse responses fmar41(t) go through zero at all ¢ 
= kT and the noiseless output of the M+1* filter 
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AM ) 


»» p> Omi ma ailt — kT) 
is zero periodically, independent of the input data. 

The filter output at t = IT can be squared and averaged to obtain 
an estimate of the noise level and hence an estimate of the transmis- 
sion performance. If the shape of the noise power spectrum is known, 
one gets a quantitative estimate of the noise power. Timing errors or 
poor knowledge of B(w) can lead to the noiseless output of the M@+1™ 
filter being nonzero at the sample time. The indicated noise variance 
would be greater than the correct value, thus indicating poorer per- 
formance than the noise alone. However, timing or channel characteri- 
zation errors actually do lead to poor system performance so that the 
monitor indication is in the right direction. Notice that this monitoring 
scheme is not tied to any particular choice of transmitter or receiver 
and is perfectly general. 


5.2 Pulse Shape Modulation Systems 


5.2.1 A Binary PSM System 


Insight into pulse shape modulation system design constraints can, 
perhaps, best be gained by examining a binary system such as that 
shown in Fig. 9a. The equations that must be satisfied are 


} Oe 2 2 2 
#3 Aa(u + 222) a(u + BE}o,(u + 2B) = bun G, + Fol) 


(17b) 


for m, p = 1, 2. Because it is a binary system, the receiver can be 
just a single filter 


Cw) = C2) — Ci) (33) 


Qn 2n3r 2n3r 


pe Alu + naw + nay + aa = —-G-+F(u) (84a) 


where it has been assumed without loss of generality that 


G=G,=G, 
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Fig. 9 — (a) Binary PSM system. (b) Modified binary PSM system. (c) Equiv- 
alent binary PSM system or PAM system with data constraints. 


and where F (w) is an arbitrary function of frequency 


Pau) = Fee) — Fw) = >> fe™”. (35) 


This modified system described above is shown in Fig. 9b. 

Fig. 10 shows two possible time domain responses which satisfy 
equations 34a and b. Notice that the responses differ only at t = 0 
and are identical to all other t = kT. This is the time domain implica- 
tion of equations 34a and b. This corresponds to the case where two 
signals are chosen to produce the same intersymbol interference which 
was discussed briefly by Simon and Kurz.° 

An alternate way of viewing equations 34a and b is to notice that 
the two transmitters can each be decomposed into two components. 
The information component A;(w) of each satisfies 


= 2 2 
7 Ade + lle + oes) =e Go 


na=—-O 


SYNCHRONOUS DATA TRANSMISSION 293 


or the usual Nyquist criterion, and is transmitted with an amplitude 
of +1. The steady component A, (wo) satisfies 


Ly Adu + nr a(u +t o(u + Zn =F) (87) 
s ie Soere T 1 1 
and is sent with an amplitude of one regardless of the data stream. 
The transmitted waveform A,;(w) can be anything because F(w) is 
arbitrary. Fig. 9c shows this system, which is equivalent to the original. 
The corresponding data vectors are 


ae oy ok "| (38) 
1 ] 


The basic equivalence of the PAM and PSM systems is thus made ex- 
plicit. The difference in the two systems is basically a noninforma- 
tion bearing signal which represents an inefficient (theoretically) use 
of power. This point has been brought out by Bennett and Davey’ in 
discussing the Sunde! model of a synchronous FM system. 


5.2.2 Sunde’s FM Model With a Linear Receiver 


In Sunde’s'! model of a synchronous FM system, one of two phase 
continuous signals 


a,(t) = cos =H4 i+ 6 


ae 
a(t) = — eos emg + DE +96 

T 
a,(t) = a(t) = 0 elsewhere 

















aT “ests al “et fe) T 2T 3T 47 


Fig. 10— Possible time domain responses for distortionless PSM system. 
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is sent during each interval. The transmitter output may be written 


0 


> ay, cos (0,2 =e a) — (1 — a) cos (0.0 Pack a) 
k 


=—0 


or 
sin tsin (wt + 0) + >> (2a, — 1) C08 t cos (w,t + 8) 
k=-0 
where 
5 e CLE De, 
T 


This second form of the output is an explicit example of an informa- 
tion-bearing component (second term where ay, = 1, 0) and a steady 
state component. 

To achieve distortionless transmission (with a linear receiver) one 
must choose any linear filter which satisfies 


i< 2 2 2 
1S Aut 2)a(u + *)c(u+ Bt) = @ 


where 
T/2 rT ; 
Aw) = 7 COS t cos (w,t + oe '** dt (40a) 
—-T/2 
= e Sw — w,) +e’ S(w + @,) (40b) 
where 
2s) = Sea a EE 4 ere Ee) 


If one makes the assumption that S(w+o,) 1s negligible at positive 
frequencies, then 


— cog a edL lee | ia 
A,@) = cos 5 @-w) > e w > 0. (40d) 
By substituting equation 40d into 39 the requirements on B(w)C(o) 


can be found. The minimum bandwidth solutions are (neglecting con- 
stants) 


=o) a e/T 
cos (w — w,)T'/2 


wo +nr/T <wo<o,+(n4+1)c/T n= —-1,0 


= 0 elsewhere 


Be)Cw@) = 


(41) 
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which are shown in Fig. 11a. Solutions in other regions (other values 
of n) are possible but require infinite gain. 

Sunde’s solution for the minimum bandwidth filtering before a phase 
derivative (nonlinear) detector is given in Fig. 11b for comparison. 
Notice that the linear receiver requires only half the bandwidth re- 
quired by the phase derivative detector for distortionless transmis- 
sion. The response of the linear filter to the steady state term is un- 
important because it is deterministic and can be removed. 


5.2.3 Monitoring System Parameters 
If a second filter C,(w) is added to the system of Fig. 9c, one again 
can monitor some aspect of system performance if the equation 
LS 2n ( | 2nr ( Pam Z 
& OD Alu + 2 )B(u + 72 C\u + =) = 0 (42) 
is satisfied. Thus, the output at the sample times will be independent 


of the input data. However, there will be a constant output value (ex- 
cluding noise) of 


pe oe 
where 
= 2 
1 A(n + 2)a(u + N0,(u + 72) = Rly (43a) 
= 2) fue" (43b) 
—1.27 
|B(w)C(w)| |B@)C(w)| aus 
n=-1 n=0 
(e) pees Do Wot7/T 





|B(w)C (| 





Wo- T/T Wot 1/T 


Fig. 11— Minimum bandwidth filtering for FM system with (a) linear re- 
ceiver and (b) phase derivative receiver. 
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because of the steady state transmitter. The total output of this 
filter is then a (generally) nonzero constant which depends upon the 
steady state transmitter and channel characteristics and the noise. It 
does not depend upon the data sequence or the receiver’s estimate of 
the sequence. A change in this constant reflects a change in the trans- 
mitter or channel parameters (for example, phase or gain) and can be 
used to modify the receiver characteristics (such as, phase or threshold 
level). Thus, the noninformation part of the transmitted signal, in 
addition to perhaps simplifying implementation, also can be used to 
provide needed information to the receiver. A simple example is the 
reference tone for carrier recovery which in fact makes the PAM sys- 
tem into a PSM system. 


5.3 Pulse Amplitude Modulation System With Zero Constraints on Certain 
Channels 


In section 5.2 we showed that PSM could be considered as PAM 
with constraints on the input to certain channels. In other words, the 
equivalent PSM system shown in Fig. 9c contained one channel whose 
input was constrained to be a one at all times. Now we will discuss a 
system in which certain channel inputs are constrained to be zero. 

Consider a four-channel PAM system and assume it to be serial; 
that is, 


Ay = Ae | - ee (9a) 


Calo) = Clo) exp | j VF |. (9b) 


If the input data vector is given by 


Qik 

—) a 

a, = af (44) 
Asp 
Agr 


then a bandwidth of 47/7 is required. Now, under certain circum- 
stances it might be desirable to reduce this data rate by inserting 
zeroes for some of the a, (that is, not transmitting anything at cer- 
tain times). 
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Chang® has considered this possibility for improving performance 
in the presence of severe noise. In this case, some a; can be made 
zero and the remaining data can be transmitted with increased power 
(maintaining a fixed average power) to improve the noise margin. 

Another purpose of this zero stuffing technique might be to shift 
(as well as reduce the bandwidth of) the spectrum of the transmitted 
signals. As a trivial example, the data vector 


Qik 

aeeee (45) 
A131, 
0. 


could be transmitted with a flat spectrum over either the region 
O<|w| <2r/T or 2/7 < |w| < 40/T 


and zero elsewhere. As a nontrivial example, consider a generalization 
of a signaling system, invented by Bennett and Feldman,’ to prevent 
intersymbol and interchannel interference in multiplex transmission. 
The original system has been described very briefly by Sunde.*? Here, 
the generalized system can be approached by writing the data vector 


a, =| (46) 


0 


where only the outputs of the first two receivers must be examined. 
With the assumption (for simplicity) that B(w) = 1 the constraining 
equations become 


; Ne A(u + 2m oy + ane: th (47) 


=—00 


and 


lec 2 2 ‘ 2 ig 
T 2s, A(w + nol + an) exp +t i(u + 2a aoe 0. (48a) 
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Recognizing that exp (+ju 7/4) is a nonzero term which can be re- 
moved, 48a becomes 


ee A(w + 2a aly + 2a exp -& ics = 0. (48b) 

Tig. 12a illustrates the type of characteristic A(w)C'(w) which satisfies 
the constraints. (There are others of larger bandwidth which also will 
satisfy the constraints.) If the characteristic is limited to | w| < 47/T 
and zero elsewhere, it has symmetry about w = 27/T and vestigial 
symmetry about |w| = 7/T and 37/T. It can easily be verified that 
the equations are satisfied when one uses the value of the multiplying 
factor exp +jn7/2 which is shown in each region. 

Notice that a response which is flat from 7/T < |w| < 30/T and 
zero elsewhere satisfies the equations and represents the minimum 
bandwidth approach to this scheme. The time response one obtains 
at the receiver using this technique is illustrated in Fig. 12b. It is 
constrained to be zero at allt = kT except k = 0 and +(4q — 2) for 
all g and can be used for transmission, as explained, without distortion. 

The advantage of this particular scheme is that it represents a base- 
band technique for shifting the transmission spectrum without modu- 
lation merely by inserting zeros into the data stream. It appears 
particularly attractive for placement within a voice channel (for ex- 
ample, 200Hz-31KHz) as Fig. 13 shows. Here no modulation has been 
required, the energy is concentrated in the center of the band and 


exp + jO% = -| ¥J { ad a 


Pa rn > a 






A(w)C(w) 


-30/T —7/T fe) T/T 37/T 





Fig. 12— (a) Frequency characteristic, and (b) time response, for distortion- 
less transmission with zero stuffing. 
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A(w) B(w) C(w) 





fe) 1000 2000 3000 
FREQUENCY IN HERTZ 


Fig. 13 — Zero stuffing spectrum for voiceband transmission. 


one could obtain symbol rates of 3200 symbols per second with easily 
realized filtering. The primary disadvantage would be increased sen- 
sitivity to timing errors. 


VI. CONCLUDING COMMENTS 


The thesis of this paper has been that all linear data system de- 
signs are based on the modified Poisson sum formula 


poet) = Ene @) 
which relates the time domain samples to the frequency domain con- 
straints. Various types of systems which a designer may choose re- 
quire a variety of constraints on the time samples r,. These values 
of rz, which depend upon the type of system chosen, then specify the 
frequency domain requirements. 

Section 5 gave a sampling of the range of systems which can be 
designed using equation 3. That section certainly does not exhaust 
the possibilities and we hope that it does not limit the reader’s im- 
agination. Most of the examples (as well as most real systems) as- 
sume systematic choice of transmitted signals; usually related by 
integral time or frequency shift. There may, however, be potential 
gain in considering nonsystematic transmitters and receivers. This 
may easily be done using equation 3. The last case examined, that of 
spectrum shifting by adding zeros, is just such a nonsystematic func- 
tion when viewed from a serial transmission point of view. 
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Click Comparison of Digital and 
Matched Filter Receivers 


By H. L. SCHNEIDER 
(Manuscript received May 25, 1967) 


We applied the click theory of errors to determine the performance of a 
digital FM receiver. The receiver had binary orthogonal FSK modulation 
in a channel that had a single random-phase echo at the symbol duration. 
We use practical bandwidth assumptions to show that this error performance 
ts tdentical to that calculated for a matched filter receiver. Numerical results 
show, for example, that an increase in signal-to-noise ratio of 10 dB is 
required to maintain a 10~* error rate when an echo of half the signal power 
is added. 


I. INTRODUCTION 


The concept of clicks in an FM receiver was originally used by 
S. O. Rice? and J. Cohn? to explain the effect of noise on analog signal 
demodulation near threshold. Recently, several theoretical investiga- 
tions of digital FSK signal demodulation have applied the concept of 
clicks in analyses of low pass filter processing of the discriminator 
output. Klapper,*? and Mazo and Salz* modelled the low pass filter 
with an integrate-and-dump function, while Schilling, Hoffman, and 
Nelson considered a gaussian low pass filter.’ In all cases, additive 
gaussian noise was assumed to be the sole source of interference in 
the signal channel. 

In this paper, we consider intersymbol interference that is induced 
by delay dispersion in the signal channel. Analysis is limited to a 
practical single-echo channel* and binary orthogonal modulation. Al- 
though the analysis seems to be tractable for only special cases, we 
gain insight into the error mechanism of digital FM reception. 

The relation between clicks and errors is viewed as follows. Since the 
fundamental description of clicks concerns a random angular encircle- 


*The single-echo channel was used by Bennett, Curtis, and Rice® in their 
study of analog angle modulated transmission systems. 
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ment of the origin by the received signal plus noise vector, it is 
convenient to choose the integrate and dump model for the post-dis- 
criminator filter. Then the filter output is a measure of the angular 
change of the received vector over a symbol duration. Signal and click 
angular changes are readily compared. The particular signal angular 
modulation considered here is --7 radians; in this case an error occurs if 
and only if a click occurs, to a good approximation. ft Intersymbol inter- 
ference is considered as a perturbation of the signal modulation. This 
distortion affects the instantaneous signal-to-noise ratio and the in- 
stantaneous frequency which, as shown by Rice, are the controlling 
parameters for the click (error) probabilities. 

After the calculations described above are used to compute the error 
rate for the digital FM receiver, another computation for error rate is 
made using a noncoherent orthogonal matched filter receiver. The error 
performances of the two receivers are the same for this binary signal 
having angular modulation +-7 radians. 

In the following sections, the modulation and the channel param- 
eters are first defined. An expression is derived for the distorted out- 
put of the channel. The derivations of the FM receiver performance 
and the matched filter receiver performance are explained, then the 
significance of the work is discussed. Two appendices give detailed 
derivations of the receivers’ performances. 


II. FSK MODULATION IN THE SINGLE ECHO CHANNEL 


Since the two receivers are applied in turn to the same channel as 
shown in Figure 1, we shall first express the output of this single echo 
channel. The input waveform is either s,(¢) or so(t). 


30] Referer) Oster 
s2(t) pa Re iene 


where s;(t) = so(t) = 0 otherwise 


(1) 


f, is a center frequency 
fais the frequency deviation 
T is the symbol duration. 


(We shall consider only the deviation: 2f,7’ = 1.) These input wave- 
forms are applied in some arbitrary sequence to the channel; the 


+ Mazo and Salz4 have considered the approximations involved in some detail, 
and their work relates different angular modulations. 
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INPUT 
WAVEFORM CHANNEL 
S,(t) 
Salt) 
WHITE 
GAUSSIAN 
44,0, 


RECEIVER 





SYNCHRONOUS 
SAMPLER 


Fig. 1— Data transmission system model. 





channel input voltage can be described as 


é:() = x s,,(t — nT) = 1D; (2) 


n=—0 


The output of the channel (receiver input) with noise added is 


e(t) = Re faye’*e:,(t — 7:) + ane’? *ein(t — 72)} + en(2) (3) 
where 


a,e’** (¢ = 1, 2) are the complex tap gains and we shall consider 


that; ad 
7; are the tap delays and we shall consider the case rz — 7 = T 


ey(t) is additive white Gaussian noise. 


In the ensuing work we shall refer to the first term in the braces of 
equation (3) as the signal, and to the second term as the echo. Since 
the echo is displaced by one symbol duration, the receiver input in 
simply a superposition of signal and echo s;, s; (2, 7 = 1, 2). Because 
each combination is assumed to be equally probable, and because cor- 
responding conditional error probabilities are equal, it is sufficient to 
evaluate the cases s;, s,; and s;, se. Thus the receiver input is either 


e(t) = Re fae eee ae Gee gr) oh ey(t) 
; O<i<T (4) 
or 
e(t) = ag ee? + Pe asa va an ey(t). (5) 
lil. FM RECEIVER PERFORMANCE 


The FM receiver model used here includes a predetection filter, 
limiter, discriminator, and a postdetection integrate and dump circuit, 
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as shown in Figure 2. The predetection filter is used to reduce noise 
and is supposed to have negligible effect on the modulation. Then the 
receiver output is proportional to the angular change of the input 
modulation over a symbol duration. 

We proceed by first rewriting the input voltage, represented by 
equation (4) or equation (5), in a form that shows explicitly the 
amplitude and angular variations of the noise-free envelope in the 
form 


e(t) = Re {A(de'? ei?*""}, 


The envelope which describes the signal-echo pair s,, s,, corresponding 
to equation (4),is 


A (a; +a;+ 20,2 cos (y, — »)|} 
o(t) = Qrfat + go 


where go is a constant. The envelope which describes the signal-echo 
pair $1, S2, corresponding to equation (5), is 


(6) 


A(t) = [az + a3 + 2a,a, cos (4x ful + 1 — @)]? : 


7 — tan | Sein elt +s — 0) —] 
e(l) = 2rful + , — tan E + dz cos (Amfut + 1 — ¢2) J 


(In these equations, A(t) and g(t) have been obtained by straight- 
forward trigonometric relations from equations (4) and (5).) 

Thus in the absence of noise for signal and echo pairs 81, 81 or 81, 82, 
the receiver output is proportional to 


Ag = oT) — 90) = 2afaT = +7 (a. < a), 


Similarly, complementary signal-echo pair s2, 8s. or 82, 8, would give 
an output Ag = —7z. 

The noise perturbation is considered an additive error angle 6(¢), 
illustrated in Fig. 3. Now the FM receiver output is proportional to 


w(t) 


v(t) = ot) + Od) 


iced ey eae ee . INTEGRATOR 
PREDETECTION LIMITER T 
FILTER DISCRIMINATOR fi dt 
ie) 


Fig. 2— Digital FM receiver. 
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Fig. 3 — Angular perturbation caused by noise. 


and the angular change over a symbol duration is 


Ay = Ag + Ad. 


The decision threshold is placed at Ay = 0, midway between +7. 
When the transmitted signal has an angular modulation Ag = +7, an 
error is made if A@ < —7. 

Fig. 4 illustrates possible loci of the signal plus noise envelope FR. 
For signal alone, the locus is simply a semicircle. With echo and noise 
added, no error is made provided Ay > 0. We observe that the locus 
encircles the origin in a counterclockwise direction. But when a nega- 
tive click occurs, the locus encircles the origin in a clockwise direc- 
tion, Ay < 0, and an error is made. 

The probability of error is obtained from the probability of a nega- 
tive click during a symbol interval. Rice* defines H— dt as the proba- 






SIGNAL ONLY 


]SXSIGNAL PLUS NOISE 
Ay <0 = ERROR 






<_SIGNAL PLUS NOISE 
Ay >0=NO ERROR 


Fig. 4 — Possible loci of angular change. 
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bility of a negative click: the angle @(t) decreases* through an odd 
multiple of + between ¢ and ¢ + dt. H_ dt is a function of signal-to- 
noise ratio and the time derivative of the angular modulation, which 
are time- and phase-dependent according to equations (6) or (7). The 
desired probability of error is obtained by integrating H_ over a 
symbol duration and averaging over the random channel phase angle: 





1 Qr T 
-x[ wf Foad (8) 
where 
A*(2) . ; : 
p= a is the instantaneous signal-to-noise ratio 
€n 


e2 is the noise power passed by the predetection filter 
¢ is the time derivative of the modulation angle ¢(é) 
= 9, — ¢2 is the relative echo phase, assumed uniformly distributed 
over (0, 27). 


As Appendix A shows, the error probability obtained when A(t) 
and y(t) from equation (6) are substituted in equation (8) is 


2 2 
Pr, = 5 1 St) exp (-25 81), (9) 
2en 


The error probability corresponding to equation (7) is 


Pa = OS, As |- 5 H(S%) ew (-85%), a0 


N 








where Q[-,:] is the Marcum Q function. The average of Pe: and Pes 
is simply 








P= 4Pa + Pa) =} 50, 4s | (11) 


The noise power e2 depends on the predetection filter bandwidth, 
which can be estimated using Carson’s rule with the Nyquist criterion 
for video bandwidth. These assumptions give a bandwidth B 


B= A + 2) = 2 He (12) 





*Decrease means in a direction opposite the time derivative of the modula- 
tion y(t). It is possible that @(¢) can also increase by w and thus cancel the 
decrease; the probability of this occurrence is asymptotically negligible for low 
error rates. 
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and thus a noise power 


e; = BN watts (13) 


where N is the noise density in watts/Hz. Substitution of equations 
(12) and (13) into equation (11) gives 


r= Lofal) ( 09 
where 


a= a is the echo/signal voltage ratio 


1 


E = 14a{T is the signal energy /bit. 
IV. MATCHED FILTER RECEIVER PERFORMANCE 


We are concerned here with the incoherent matched filter receiver 
shown in Fig. 5. The mark and space filters are matched (except for 
phase) to the waveforms s,(¢) and s,(¢) defined by equation (1). As Fig. 
5 indicates, the combined operations of filtering, square law rectifying, 
and time sampling produce R? and R? which are the squared envelopes 
of the filter outputs at the end of the symbol interval. Assuming mark 
is transmitted, the probability of error is 


P = Prob {R?2 > R?} = [ dR, : MRR aR. (15) 


where p(R1, Ry) is the joint density function of R; and Ro. 
As shown in Appendix B, the error probability corresponding to 
equation (4), with the signal-echo pair s1, s; 1s 


-15(2) exp (2428) 
Py = 5 1() exp \——~oy (16) 
The error probability corresponding to equation (5), with the signal- 
echo pair 81, S218 







MARK 
FILTER 
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DETECTOR 


SPACE . 
FILTER R3 


Fig. 5— Matched filter receiver. 
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ra ole) (8) ] 218) ao (-25 08). on 


The average of P,, and P,» is 


P. = Pa + Pa) = 49 o(2), (2)'], (18) 


This is identical with the error performance of the FM receiver, 
specified in equation (14). 





V. DISCUSSION OF RESULTS 


The concept of clicks has made possible a unique comparison be- 
tween digital FM and matched filter receivers. When a suitable pre- 
detection filter is chosen for the FM receiver and the assumption made 
that this filter does not significantly process the signal, then the error 
performance of the two receivers is described identically. 

We have gained particular insight into the error mechanism of the 
digital FM receiver under conditions of intersymbol interference. The 
analysis shows how the rate of occurrence of the noise clicks is criti- 
cally dependent on this distortion of the signal waveform. This is in 
direct contrast to the usual AM systems where intersymbol inter- 
ference manifests itself by a gradual degradation caused by “eye” 
closing. 

Numerical results, illustrated in Fig. 6, show that the receivers’ 














ERROR PROBABILITY 














Fig. 6 — Error performance in single echo channel. 
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performance can be summarized in terns of the increase in signal-to- 
noise ratio required to maintain a desired error rate when an echo is 
superposed. For example, 10 dB higher signal-to-noise ratio is required 
to maintain a 10* error rate when an echo having half the signal 
power is added. It is easy to show that the asymptotic deterioration 
in performance with echo behaves as 20 logy (1—a)dB. 
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APPENDIX A 


Click Probabilities 


This appendix concerns the application of an average click proba- 
blity to FM receiver performance. The mathematical details are given 
here which relate equations (6), (7), and (8) to equations (9) and 
(10). 

It has been shown in Refs. 1 and 2 that the probability of a click 
in time dt can be approximated as* 


Hd? e dl (19) 


~~ Or 
where ¢ is the time derivative of the envelope angular variation and 


At 


2 
2ex 


is the signal-to-noise ratio. Substitution in equation (8) gives the er- 
ror probability 


Phe [¢ fe, ‘ exp | 4&9 (20) 


27 Jo Qe 


where x = 9; — ¢2. Substitution in equation (20) of A(z, ¢) and the 
time derivative of ¢(a, t), g(a, t) = 2af,, from equation (6) gives for 


* This validity of this approximation depends on sufficiently large p for p # 0. 
Klapper’ has discussed this in some detail. Although the approximation is not 
as good with intersymbol interference, it appears adequate. 
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the signal-echo pair s, , & 


Pa = dx ( fa ep |—3 a3 + a2 + 2a,a, cos 2) | a 


-41(6g)o(-#285) 
2 P 202 


where we have used the integral definition of Jo(-) and noted fa7’ = 3. 
We now take A(z, t) and ¢(z, t) from equation (7) and notice that 
the time derivative of the latter can be expressed as 


2 
; a; + a,a> cos (4rfat + ¢, — ¢2) | 
= — : 22 

a iF + az + 2a,a2 cos (Arfat + 9 — 92) ane (22) 
When the first term of equation (22) and the expression for A(z, ¢) 
are substituted into equation (20) the integral can be recognized as a 
Q function representation given by Helstrom:’ 


(21) 











2\ pan (1 — © cos u}er*orm 


where a < 8 and we have replaced 4 = fat + 91 — ¢2 + a by the vari- 
able w. 
We make the identifications 


a gee 
(ex)? ’ (ex)? 
Substitution of the second term of equation (22) gives an integral 


identical to the right-hand side of equation (21). Thus, for the signal- 
echo pair S81, Se 


Pe. = o| 5, | — 4 1( 22) exp (-S@ 8), 
(en)? (en)? en 2en 


APPENDIX B 




















Matched Filter Receiver 


This appendix concerns the application of the single echo channel 
waveforms to filters which are matched to the waveforms in the 
absence of the echo. The mathematical form is similar to the form 
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illustrated by Helstrom,’ and we give only a brief summary here to 
show the effect of the echo. 

The error probability depends on a comparison of the sampled out- 
puts, R, and Roe, of the two matched filters. Helstrom shows 


RP=xX?4+ y?, ¢=1,2 (23) 
where 

a / s@ybes OH Dal (24) 

ro= / e(t) sin (2rf;t) dt (25) 


where e(t) is signal plus echo plus noise defined by equation (4) or (5). 


f; is one of the signal frequencies 
fi =— fe + fa 
fo = te oe, fa . 


Substitution of e(¢) as given in equation (4) for the signal-echo pair 
S1, $1 gives 


xX, = an cos g; + wt cos g + In, 
Y, = —22 sin 9, == OE sin g + Ive; 
(26) 
Xe = Iv 
Y, = Iva 


where Iy1, Ive, Iy3, Iva are zero mean independent Gaussian variables 
having equal variances o? = NT'/4 for noise density N (watts/Hz). 

From these terms, we find that the joint distribution of R, and Re is 
p(Ri, Re) 








Rik, , (iC, Ri +R, + Cy 
ph, ,R.) = —- | A o ) exp G2 (27) 


2 
2 
o 20 


where 








c= 2) (28) + 0228) =o. 
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The error probability is 


I 


P / dR, / dRee@y, BR) 
(0) Ri 


1 (-<,) 
g XP 4o°]° 


Averaging this valuc over x = ¢; — ¢» yields 


ee at (az) | Bt ez | 
i age -5-/ P(x) dx = 5 1, vi) &XP | — ony : (29) 


Similarly, substitution of e(t) as given by equation (5) into equations 
(24) and (25) gives, for the signal-ccho pair s1, se 


(28) 


AX, = ue cos g, + Iy, 





= — sin g, + Ixe 





(30) 
X, = tol COS go + Ix 
Y, = aan sin g, + Tyg. 


Now the joint distribution of R,; and R. is found to be 


RR, , (RiC,\, (RC, Ri+tkR+C+C 
ptt Ra) = Mabe 1 ("HE (EE) exp (BARGES) oa 








20 
where 
c,= 48 
T 
C, = 3 : 


The error probability in this case is found via the following steps. 
Pi = ‘ dk, / dk, p(k, ? R,) 
0 Ri 


= fon EAC) (3 
0 o o o 


o 20° 


(32) 
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where we have substituted p(R,, R.) from equation (31) and used the 
Q funetion definition: 


Qa, 8) = fc iI,(at) exp (+ £) dt. 


We see, by this manipulation, that equation (82) is integrable; for 
example, as shown by Stein.® Thus 


ae ae (ae) p (-G+,) 
a Gy er ee aa ey 


Appropriate substitutions of the terms from equations (31) and (14) 
give equation (17). 
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Eliminating Broadband Distortion in 
Transistor Amplifiers 


By LEE C. THOMAS 
(Manuscript received July 6, 1967) 


This paper presents the results of a study directed toward understanding 
the basic distortion mechanisms in transistors. (1) We develop an analytic 
model for the transistor which describes small signal linear performance 
and nonlinear effects. The linear model is matched to the measured h- 
parameters of the device over a wide range of frequency and bias current. 
We superimpose three distinct nonlinear effects on this linear skeleton 
model, all approximated to third order terms. (it) We show experimental 
confirmation that, for some bias-load conditions, the second order distor- 
tion can be minimized and we show that it is possible to simultaneously 
minimize both second- and third-order distortion under the same bias-load 
condition. This result also ts confirmed experimentally. (iit) We derive 
and discuss 1n detail an analytic expression for the optimum load. Based on 
this expression, we present detailed procedures for finding this optimum 
condition for any transistor, and give experimental corroboration. (iv) We 
give a qualitative description of the interaction among these three nonlinear 
effects based on an analog computer simulation of the model. This description 
makes it easter to visualize the distortion cancellation phenomena derived in 
this paper, and indicates a technique for extending the effect to a broad band 
of frequencies. We conclude that proper use of the distortion cancellation 
effect can greatly vmprove intermodulation performance in existing tran- 
sistors. 


I, INTRODUCTION 


System studies have indicated that very broad band (greater than 
20 mHz) AM coaxial cable systems will be modulation-limited. Inten- 
sive investigations to understand and characterize the inherent modu- 
lation properties of devices and repeater circuits have been called for. 
We made one such study directed toward understanding the basic 
distortion mechanisms in transistors. 
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The history of transistor distortion literature can be characterized 
as an erosion process in which highly restricted parts of the total 
problem are attacked leaving fresh complexities exposed for future 
work. In early work by Akgun and Strutt, the analysis is restricted 
to nonlinearities in the emitter resistance assuming an ac short at 
the input and output.t Observed nulls in second and third order 
distortion do not correlate with the theory, which does, however, 
include frequency effects. Using many of the same assumptions, Mal- 
linckrodt and Gardner extended this earlier work to account for a 
third order null at low frequencies when the nonlinear emitter resis- 
tance is dominant.? 

More recently Riva, Beneteau, and Dalla Volta considered all 
important sources of distortion by breaking the problem into three 
distinct operating regions with expressions for minimizing second 
order distortion in each.? They do not treat of third order minimiza- 
tion, and they use a de model. Reynolds analyzes third order mini- 
mization at particular nonzero frequencies for dominance of the 
emitter resistance nonlinearity.* 

There are two reasons for the specialized nature of these efforts. 
First, transistors, as contrasted with vacuum tubes, have at least three 
dominant nonlinearities. It would be difficult to consider all of these 
in a general expression for second and third order distortion. Second, 
frequency effects can be important in many applications. In general, 
the analysis of nonlinear effects as a function of frequency requires 
the use of extremely powerful and, as a result, cumbersome analytic 
techniques. In the special case of an exponential input v-7 relation 
it is possible to avoid a general analysis, which explains why analyses 
which include frequency effects have been limited to emitter non- 
linearities. Even in this exponential case, however, the third order 
null predicted by Reynolds is a narrowband effect, applicable only 
at a particular frequency. 

This paper extends these earlier efforts in four important respects. 

(t) We conclude that the distortion measured at the terminals 
results from algebraic cancellation between distortion components 
produced by nonlinear effects within the transistor. This conclusion 
originated from empirical observations made on an analog computer 
simulation of a transistor. An analytic argument reinforces this con- 
clusion by comparing plots of algebraic cancellation to measured dis- 
tortion curves. Also, we give experimental support of the cancellation 
phenomenon. 
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(ti) We present a low-frequency analysis of a complete extrinsic 
model including three nonlinearities: emitter resistance, nonlinear 
current gain, and avalanche multiplication, all approximated by a 
third order polynominal. We avoid considerable complexity by direct- 
ing the analysis strictly to the question of minimizing distortion and 
by not developing a general distortion expression. This analysis is 
independent of any assumptions concerning distortion cancellation, 
but yields the same results. 

(ww) From this analysis we show that it is possible to stmultaneously 
null both second and third order distortion under the same bias-load 
condition. The analytic technique we use to determine a null is 
linearization of the input-output relation up to and including third 
order, thus implying a minimum in harmonic distortion, intermodula- 
tion, or any other specialized figure of merit. The existence of this 
simultaneous null is verified in the laboratory. 

(wv) Extension of the cancellation effect to a broad band of fre- 
quencies can be accomplished by external reactive compensation. 
This compensation maintains a 180° phase shift between the collector- 
base voltage and the real component of the emitter current, a relation 
that exists automatically at low frequencies where the rigorous anal- 
ysis is performed. This phase shift is the fundamental requirement 
for total cancellation, based on the qualitative insight mentioned in 
item 2. 


PRINCIPAL SYMBOLS 


A Parameter in the G(U/.) relation. 

a(I,) Current dependence of the dependent current source. 

@,,@,a3; ‘Taylor series coefficients in the expansion of a(J,) around 
| ioe 

1 Maximum value of a with respect to J, . 

B Common emitter ac gain. 

Baas Maximum value of 8 with respect to J, . 

I, Total collector current. 

Te Collector current where 8,,,. Occurs. 

te Small signal collector current. 

Ve Total emitter current. 

Te Emitter current bias level. 

dy Small signal emitter current. 

ty Collector current bias level. 


Current in the load resistor. 


3° 
e 
e 
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M(V.s) Voltage dependence of the dependent current source. 

M,,M,.,M; Taylor series coefficients in the expansion of M(V.,) 
around V,. 

ie Emitter resistance. 

Ti Va a Te Taylor series coefficients relating v, to 7, . 

Ry Load resistance. 

Rishon Load which minimizes second order distortion when third 
order distortion is negligible. 

R, Source resistance. 

Ver Total collector-to-base voltage. 

Veo Small signal collector-to-base voltage. 

Vv. Small signal voltage across r, . 

Y, Collector-to-base bias level. 

Vout Voltage across the load resistor. 


II. A QUALITATIVE MODEL FOR THE DISTORTION MECHANISM 


Let us describe the qualitative insight (7) to get a broad look at the 
cancellation phenomenon before rigorous analysis obscures a simple 
concept. 

An analog computer simulation of the model of Fig. 1 allows us to 
examine the interaction of the three nonlinearities by examining their 
effects one at a time. Thus, for example, we may allow only a(I,) to 
be nonlinear and observe the second harmonic distortion components 
of the output voltage. If we then make @ constant and allow M(V,,) 
to vary, we observe that the resulting waveform is 180° out of phase 
with the first waveform as shown in Fig. 2. This is plausible since 
V., and I, are inherently 180° out of phase at low frequencies. Thus any 
cancellation that we obtain between current dependent nonlinearities 


Che 





Fig. 1— High frequency nonlinear model. 
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Fig. 2— Cancellation of distortion components. 


and voltage dependent nonlinearities will not require phasing I, and 
V.» properly, but will result from properly adjusting the relative 
magnitudes of V,, and I, . 

Since IT, ~ I,u., the most direct way to adjust the magnitude of 
Vier © Vout relative to J, is to change the load resistance. Hence the 
strong dependence of distortion on R, as shown in Fig. 3 for a fixed 
bias level of V., = V, and J,u, = I, . To obtain cancellation in second 
and third order distortion at the same time, not only the relative 
magnitudes are important but the absolute level must be correct. 
This cancellation model explains the sharpness of the null: since the 
net distortion is a small difference between large distortion components, 
a small percentage change in the ratio of the larger components will 
yield a large percentage change in the difference. Experimentally, as a 
null is passed the output distortion waveform changes phase by 180° 
as we would expect from one component’s becoming dominant over 
the other. 

It is important to notice that this cancellation effect is not some 
artificial phenomenon that we are forcing to occur. According to the 
model presented here, some degree of cancellation always occurs in 
any transistor at any level of distortion. We give a more quantitative 
argument supporting this exact cancellation model for visualizing the 
transistor distortion mechanism in Appendix C. 

It has been the author’s experience that a disturbingly large per- 
centage of published technical material is exclusively concerned with 
presenting conclusions. In most cases, these conclusions were arrived 
at by the rigorous manipulation of symbols long after the original 
insight which prompted the investigation. The purpose of this section 
is to describe the insights first in the belief that the reader will have 
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Fig. 3— Experimental null in second harmonic distortion as a function of Ru, 
using a Western Electric 20J transistor with V. = 30 volts, 7, = 100 milliam- 
peres, and #. = 500 ohms. 


at least one less handicap if he is allowed to see the simple ideas 
on which the rather interesting conclusions of this paper are based. 
These ideas are: 

(t) The nonlinearities of the transistor (including some, such as 
the base spreading resistance and the diffusion capacitance, which 
are not considered in this paper) are dependent on the emitter cur- 
rent, I,, and the collector-base voltage, V.,. At low frequencies JI, 
and V,, are 180° out of phase. 

(wz) As a result of this phase difference, distortion components 
resulting from these independent variables will subtract at low fre- 
quencies. | 

(wz) On an analog computer simulation, we observe the ability to 
extend this subtraction effect to the extent of total cancellation by 
manipulating external circuit parameters. Thus it should be possible 
to analyze a low frequency model by imposing the condition of zero 
distortion and solve for the required circuit parameters. We would 
expect the load resistance to be an important parameter in this anal- 
ysis since it determines the ratio of Vp to Tg. 

(tv) Considering the low frequency phase difference between J, and 
Vp as the most important factor in achieving total cancellation, we 
suggest a technique for extending the low frequency results to a broad 
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band of frequencies. This extension is achieved by the simple expedient 
of compensating the load to achieve a constant real part, Rz, and still 
maintain the proper phase between V,, and J, as frequency increases. 

The following sections develop the rigorous analysis (most of which 
is relegated to Appendix A) and examine in some detail the analytic 
conditions for a null and the implications of these conditions in the 
area of circuit and device design. 


HI. TRANSISTOR MODEL 


The model in Fig. 1 has been matched closely to the h parameters 
of the Western Electric type 46A transistor over a wide range of 
frequency (5 to 100 mHz) and bias current (50 to 150 mA). Figs. 
4 and 5 show a typical match, obtained from a general purpose 
optimization program. Three distinct nonlinear effects were then 
superimposed on this small signal linear skeleton model. The current 
dependence of the dependent current source is changed from al, to 
the expansion around the emitter current bias point, Io, 


a(f,) =I, +a,7, — I.) 
2a2(l, ~ Tay ale ga3(1. = ia3) + ad (1) 


where I, is the quiescent collector current. The voltage dependence 
of the dependent current source is changed from the constant, M, to 
the expansion around the collector-to-base bias voltage, V4, 


M(V.) =1+ MAV., — V.) 
i aM AV co = V.) a 6AT3( Coe V.)° 5 ee (2a) 





Fig. 4— Linear model for Je = 150 mA, Ver = 10V. With the indicated ele- 
ment values, this model matches the measured | h-parameters shown in Fig. 5. The 
quality of the match at this bias point (7. = 150 mA, Vos = 10V) is typical. 
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Fig. 5— Magnitude and phase, as a function of frequency match to (a) hun, 
ne he, (c) ha, and (d) hee. Solid lines were measured; points were plotted from 
the model. 
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So that the total dependent current source relationship is 
I, = al.) M(V.). (2b) 


And finally the emitter resistance, r-, is replaced by the expansion 
around J,, 


vu. = Till a Tes) Sa ar2(I, Tie)" A éra(I. = I. : ale ne (3) 


where the coefficients in equations 1, 2, and 3 are the corresponding 
derivatives of the Taylor series expansion. Define the small signal 
quantities as 


i,= 1, ~~ J, (4) 
i, = 1, = Te (5) 
vo = Vea — Vo- (6) 
Using these relations, equations 1, 2, and 3 become 
aI.) = I, + at, + fazte + fast, + ++: (7) 
MV) =1+ Myo + 3Mwe, + SMe + -°- (8) 
Ve = Tit, + dete + Brste + oe (9) 


Substituting equations 7 and 8 into 2b, and retaining third order terms 
I, = (I, + at, + fa2te + fasts) 
(1+ Myo + $Me, + §Msv.) (10) 
I, -1, = 4, = at, + I,.Myvo + tart. + 31.Move + aMyine 
+ fa Movieet. + fo2M ete + goste + §1.Me, (11) 


At this point we have developed a model for the transistor, indicating 
the nature and form of the particular nonlinearities considered in both 
the analog computer simulation of the complete, frequency-dependent 
model of Fig. 1 and the analysis of the de model of Fig. 6. 


IV. THE ANALYSIS 


4.1 Optimization Equations 


An analog computer simulation of the complete, frequency-depend- 
ent model just discussed suggests that a simpler model is sufficient 
to describe the distortion characteristics of the transistor at low 


324 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1968 


Ig (Ie Veb) 





Fig. 6 — Low frequency nonlinear model. 


frequencies. Fig. 6 shows this simplified dc model. The following 
analysis of this model is detailed in Appendix A. 

(t) The incremental output voltage, Vout, is related to the input 
voltage, Vg, retaining third order terms as in equation 11. 

(ii) This input-output relation is constrained to be linear, thus 
forcing both second and third order distortion to zero. 
_ (wi) This constraint requires certain coefficients in the nonlinear 
Vout (Vg) relation to be zero. These coefficients are, of course, func- 
tions of the linear and nonlinear parameters of the system. Thus, 
when these functions are made zero, Vout is a linear function of Vg 
(to third order), and the derivation of the optimization equations is 
complete. These equations are: 


—Ri(I,.M:2) + R,(2a,M,) + 6 -— Aa >= 0 (12) 
z(1.Ms) = Ri,(80,M?2) +&—a, =0 (18) 
where 
6=r/(R, +r) <0, since r, < 0, (equation 19), (14) 
and 
&£=7;/(R, +7i) > 0, since rz > 0 (equation 20). (15) 
For the simpler case where the amplitude of third order distortion 
is sufficiently low so that third order terms are negligible, equation 
13 is satisfied identically and only equation 12 remains, which is 
easily solved to yield 
Rea) ove = 4M ,/I.M, + ((orMi/1I.M.)? — (a2 — 8)/I,M.}*. (16) 


Thus FR 2)opt 18 the value of load resistance which causes second 
order distortion to be zero for the case where third order terms are 
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negligible. Notice that the analytic technique used to determine a 
distortion null here is linearization of the input-output relation, and 
thus implies a minimum in harmonic distortion, intermodulation dis- 
tortion, or any other specialized figure of merit. Of course, R(2)opt 18 a 
function of bias current and voltage because of the dependence of 
M, and Mz on voltage and re, a; and a. on current. The implications 
of equations 12, 13, and 16 become more clear when the dependence 
of these parameters on bias is considered. 


4.2 Relating Parameters to More Directly Measurable Quantities 

It is revealing to express the parameters of equations 12 and 13 
in terms of the bias variables and other directly measurable param- 
eters of the transistor. 

Assuming the standard exponential 1-v relation at the emitter-base 
junction we can immediately derive from 


I, = I,lexp QqV./kT) — 1] (17) 


the following relations: 


rf, = kT/\ol, = 7./T, , (18) 
tr = —kT/d\qI? = —r./T , (19) 
ry = QT /dgl® = 2r./E . (20) 


Similarly, if we assume that the avalanche effect in the common- 
emitter mode is described by an equation of the same form as Miller’s® 


M(V.s) = {1 = (Vis/Va) Il” (21) 


where V4 is the common-emitter breakdown voltage as shown in Fig. 
7. Then, at V., = Vi: 





Vs VA Vce > 
aM(Vs)=1 M(Va)=0 


Fig. 7-— Avalanche characteristics, 
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M, = MV ./Va)/V ) (22) 
M, = n(n — 1)(V./Va)"/Ve ’ (23) 
M,; = n(n — 1)(n — 2)(V./Va)"/V2 . (24) 


The avalanche voltage, V,, can be determined on a curve tracer 
oscilloscope by leaving the emitter open-circuited in a grounded base 
configuration and sweeping the collector-base voltage. The sustaining 
voltage, Vg, shown in Fig. 7, is obtaining with the transistor in the 
common-emitter mode and at least enough base current flowing to 
produce Igmin at the output. At Vg the avalanche factor M(V.») has 
increased above unity sufficiently so that a(I¢)M (Vs) = 1. As a result 
the common-emitter current gain (8) at this voltage is infinite. Choos- 
ing the smallest « at which this occurs (amin) allows us to determine 
the exponent, n, in equation 21: 


min (V s) =l= Onin L = (Vs/Va)'T'. (25) 


Therefore 


n & log Bnin/log (Va/Vs); (26) 


where Bmin corresponds to amin and may be determined from equation 
27 using I, = Iomin. Notice that equation 21 constitutes an empirical 
relationship in this study and is not intended to be rigorously tied to 
any one of the various avalanche mechanisms. It is apparent, too, 
that the measurements determining equations 25 and 26 will be influ- 
enced by other voltage-dependent mechanisms (for example, the 
Early effect); hence they are not strictly related to the avalanche 
multiplication effect alone. Equation 21 has the virtue of mathematical 
tractability; equation 25 allows the parameters of 21 to be determined 
conveniently; and, finally, the excellent experimental agreement with 
the theory described in Section V provides adequate justification of 
the original assumptions. In any case, the derivation of equations 
12 and 138 is based on a general power series expansion for M(V,») 
around V,; hence it remains valid for any M,, Mo, and M3. 

Finally we require a2 and a3. We show in Appendix B that @ can 
be empirically related to collector bias by 


B= Bae/ll Pp Alo, C,/13,)] (27) 


where Bmax 18 the maximum 8 which occurs at I, = Ip, aS shown in 
Fig. 8, and A is a parameter of the equation. Determination of az, 
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Fig. 8 — Current gain nonlinearity as a function of the bias current, I<. 


az, and A is derived in Appendix B. They can be expressed as 








mw _ (4Aainax I, 
oe - (Ae fn (7) ee 
pw. 6Aakax Tt; 
a (Ace a (4) : a 
A = (= pe a( =) |/bue (30) 


so that A may be determined by finding I,, and measuring £ at that 
current and at 1/e times that current. Thus equations 18 through 30 
give the functional relations for the various parameters in equations 
12 and 13 and indicate the method of measuring the more funda- 
mental parameters such as » and A. In the next section we use these 
relations in existence conditions for a simultaneous null, in order to 
guide an experimental search for this condition. 


4.3 Hxistence Conditions for Realizability 

While the simultaneous solution of equations 12 and 18 has not 
been accomplished in closed form, it is possible to derive the condi- 
tions under which a solution exists. Expressed in terms of the bias 
variables, such conditions can then be used as a guide in an experi- 
mental search for simultaneous nulling of second and third order 
distortion. 

Basically we require Rz to be real and positive. For the second 
order equation, solved in equation 16, this simply requires that 


(cM, /IeMs)? = (a, — 8)/T.Ms. (31) 
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The condition for the existence of a positive, real solution to a cubic 
of the form 


2 — pe’ +r =0 (32) 
where 
p = 3a,M,/I,M; (33) 
r= (€ — a)/I,M; (34) 
x= Rk, (35) 
is easily derived. Basically require 
| “= px —r. : (36) 


Now, from equations 34 and 29, r > 0 for I, < ely. Thus, at « = 0, 
the parabola on the right side of equation 36 will be below the cubic 
on the left. There will be a positive intersection only if the equation 
is satisfied before the cubic term begins increasing more rapidly 
(larger slope) than the parabola. The slopes are equal at 


x, = Rp. (37) 
Therefore require 
ve, Spr (38) 
or 
erp ZY. (39) 


Expressing this existence condition in terms of the problem variables 
and rearranging terms gives 


(a,M].)° = 4(E — as) (I,M5)”. (40) 


Substituting in equations 31 and 40 with 18 through 29 and arranging 
terms we obtain 


(V./Va)” > the Greater of [Q: , Qo] (41) 
where 
1\| x, ; 4 Adinax Te. 
a= (1- Di (@, +r) ~ (58) in (i) | oe 





Q = [= (f, aber = (24ee) a (4) ty 8) 


For most ranges of parameters and bias variables Q; > Qe, thus, we 
will examine the condition 
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(> (-DiR@ +o - Ges) (e)| a 


in greater detail. This distinction between Q, and Qe is not critical, 
however, because they are similar in form. Thus, many of the qualita- 
tive considerations to be developed in the next section are the same 


for Q; or Qe. 


4.4 Searching for a Simultaneous Null 








A careful examination of the existence condition (44) is useful 
in guiding an experimental search for a simultaneous null. Starting 
at the left side of the inequality, it is obvious that the bias voltage, 
V,, must be as large as possible relative to V,. Since, in any case, 
V, < V4, the exponent, n, should be as small as possible. The value 
of n, according to Rogers,> depends on whether the collector or base 
has the higher resistivity, and whether the high resistivity side is n 
or p type. 

Where the collector has the higher resistivity, the lowest values of 
n are for npn silicon, and for pnp germanium. A second, less impor- 
tant, advantage of small n is that the multiplier on the right side of 
the inequality is reduced. The first term in the brackets tends to be 
the major contributor to the right side of the inequality and is there- 
fore the term which is most desirable to reduce. This term, which 
represents input distortion resulting from a nonlinear emitter resis- 
tance, can be reduced by increasing the bias, Z,, and by increasing 
R, to approximate a current source drive, thereby reducing input 
distortion. 

The second term in the brackets will favorably reduce the right 
side of the inequality only if the logarithm is positive. This will be 
true if the bias current, J,, is greater than J,,, which is consistent 
with the earlier requirement for a larger J,. Finally, the multiplier 
X, in equation 18 should be small in order to reduce 7p. 

Thus, it appears that the most likely candidate for a simultaneous 
null is a silicon power transistor to allow large values of I, and V,. 
The structure should be either pnp or npn, depending on which type 
gives the smaller n. 


V. EXPERIMENTAL PROCEDURE AND RESULTS 


Let us illustrate the application of these existence conditions in an 
experimental determination of Ri2)opt aS well as a simultaneous null 
in second and third order distortion. 
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As a fundamental check on the theoretical results, we decided to 
determine the accuracy of equation 16 with 19, 22, 23, and 28 sub- 
stituted for the Taylor series coefficients. Also, it was desirable to 
verify the existence of a simultaneous null using the existence condi- 
tions of the previous section. Because of the low frequencies involved 
(input frequency of 1 kHz), the simplest approach was to simulate 
the measurement apparatus on the analog computer, using the same 
oscillator and bandpass filters already available on the original simula- 
tion.© The transistor used was the Western Electric 20J, and npn 
power transistor. 

Using this equipment, the parameters of the B(I,) characteristic 
curve of the transistor were measured: 


Bmax = 78 
f,, = limA 
B(I.,/e) = 73. 


From a curve tracer oscilloscope, the avalanche parameters were 
determined: 


Va = 60V 
Vs = 385V 
Bmin = 40. 
These measurements yield the information to compute 
n=7 
A = .064 


from equations 26 and 30. From the manufacturer’s data, r, = 50 mV 
and ri = 50 Q. The output power was maintained at one watt. 

These parameters give all the information required by equation 16 
to compute the function Ri) ..(7.) for various values of V,. The 
curves in Figs. 9 and 10 show this computation compared to the plotted 
points which were measured. The agreement here is quite adequate. The 
quality of the match is further emphasized by comparing the computed 
values of Ria) op: Indicated in Fig. 3 and Fig. 11 to the measured nulls. 
The computed value shown in Fig. 11 is based on a solution to equation 
16 only. 

A typical simultaneous null obtained in the laboratory is shown in 
Fig. 11. This data indicates the high voltages (to emphasize avalanche 
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Vig. 9 Measured (plotted points) and computed values (curve) of Reon: as 
a function of bias current, Jo, using a Western Electric 20J transistor with Rs = 
500 ohms and V, = 29 volts. 
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Fig. 10 — Measured (plotted points) and computed values (curve) of Reop: as 
a function of bias current, Jo, using a Western Electric 20J transistor with 
Rk, = 500 ohms and V, = 25 volts. 
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Fig. 11—Simultancous null in second_and third harmonics as_a function of 
load, Rx, with the same transistor and R, as in Figs. 9 and 10. Here Vo. = 29 
volts and IT, = 240 mA. 


distortion) and currents (to minimize input distortion) required for 
a simultaneous null. Conditions for a simultaneous null exist on an 
(Rz,Jo,V>,Rs) surface, giving some redundant control to achieve 
desired power and impedance levels as well as minimum second and 
third harmonic distortion. It is apparent that a transistor manufac- 
tured with a lower value of n would allow a broader range of control 
over the bias voltage and current level required. Measurements on 
different units of the WE20J show a maximum spread of £10 per 
cent in measured values of the optimum load for a simultaneous null. 

Experimentally, as R; is varied, the second harmonic displayed on 
the oscilloscope decreases in amplitude, goes to zero, and begins to 
increase in amplitude. As it goes through a null, the second harmonic 
changes sign, giving additional weight to the qualitative distortion 
model discussed in Section IT. 


VI. EXTENSION OF CANCELLATION TO A BAND OF FREQUENCIES 


Up to this point, our discussion has been limited to low frequency 
effects. Now let us consider why the above results do not apply at 
high frequencies and look at a straightforward approach to extend 
the validity of all previous results to a broad band of frequencies. 
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If we accept the qualitative picture given in Section II, it is obvious 
that we should not expect to maintain exact cancellation as fre- 
quency increases, since the phase of V,, relative to J, will change. A 
small change in phase will have an effect similar to changing the 
relative magnitudes of the current and voltage-dependent distortion 
components: the amplitude of their difference (the net distortion) 
will change by a large percentage near a null. In fact, at higher fre- 
quencies (on the order of f7/100), the null of Fig. 2 vanishes alto- 
gether. It is apparent, then, that a solution to this problem is to apply 
external reactive compensation in such a way as to keep Vy and I, 
180° out of phase as frequency increases. In the model shown by 
Fig. 6 if we consider a capacitor, Cp, in parallel with r,, it is straight- 
forward to derive the relation, 


—V./I. = Re {21} + 8 + Im 2.1 | ’ (45) 


Wr 


where 


1 


Or = . 
Cor. 


Ideally, we would desire Z, = Ry, — jwr{/w, but this would require 
a negative inductor. A simple first order approximation to this function 
would be to parallel R; with a capacitor, C. Then ~ 





(46) 





Zi = ee — jw (tee (47) 
From equation 47 choose 
Cont = 71/Riwr . (48) 
Now 
hore, Slee} 
For small angles the phase is given by 
y(w) = w'(r{/Rier)’. (50) 


Thus the phase is reduced below the uncompensated case up to the 
frequency 


_ fiw 
Wmax — r = 
b 
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At which point the cubic dependence of equation 50 intersects the 
linear phase of the uncompensated transistor. 

Obviously additional compensating elements can be used to cause 
higher derivatives of (w) to be zero. A complication may arise if 
Copt 18 less than the parasitic Con of the transistor. In this case we 
extend the required low-pass structure of the compensating network 
to include an inductor in series with R,. In this case the required 
inductance is given by 


2 : 
Lot = RiCce — th/wr 


which is greater than zero for Coz > 1ri/Rjwr = Con. 


VII. CONCLUSION 


Our conclusions are based on simulation of the transistor on an 
analog computer, analysis, and experimental results. The rigorous 
analysis predicts the existence of a simultaneous null in second and 
third harmonic distortion under the same bias-load conditions. This 
null has been observed in the laboratory. In addition, experiments 
on the simulation provide qualitative insight into the nature of the 
distortion mechanism. 

We conclude that this mechanism consists of the algebraic sub- 
traction, at low frequencies, of distortion components from various 
sources within the transistor such as the nonlinear emitter resistance, 
current gain, and avalanche multiplication effect. This interaction 
between distortion components yields a net distortion which is the 
difference between the contributing components, and can be made 
zero by a proper choice of the bias and load. 

With this mechanism in mind we developed a technique for ex- 
tending the cancellation phenomenon to a broad band of frequencies. 
This technique consists of external reactive compensation which 
maintains 180° phase shift between the distortion components, a 
condition which exists inherently at low frequencies. 

We have obtained experimental confirmation of the theoretical 
dependence of the optimum load for second order distortion on bias 
variables. The theory predicting a simultaneous null in second and 
third order distortion has been confirmed. We have also obtained 
experimental support for the distortion cancellation phenomenon. We 
discussed methods to aid future measurement efforts in implementing 
this distortion reduction phenomenon. These methods are based on 
interpretation of the theoretical expressions developed in the paper 
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which reveal the necessity for high levels of bias voltage and current 
to obtain a simultaneous null. 

This study opens several fruitful areas for future work, both in 
device and circuit areas. Primarily, the phenomenon described uses 
circuit techniques to minimize distortion (optimizing the bias-load 
point). Additional effort in the circuit aspects of minimizing distor- 
tion should be directed toward desensitizing the null condition to 
variations in the bias-load point. For example, if the bias current is 
forced to change with R, as shown in Fig. 9, optimum conditions 
could be maintained over a range of changes in the load. 

In the realm of device design, effort should be directed toward 
adjusting device parameters to allow nulling in useful regions of the 
bias-load space. For example, a softer avalanche characteristic (lower 
value of n) would allow the use of lower bias voltages. 
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APPENDIX A 


Derivation of the Optimization Equations 


In Fig. 6 the following relations hold 


t= t +7, (51) 
% = (V, — ».)/(R, + 7%) (52) 
Vous = —Rrte FY ver. (58) 
Let 
V/R. +1) = 1 (54) 
m/(R, +r) = (55) 
m/(R, +11) = 6 (56) 
r3/(R, + 11) = &. (57) 
Substituting (9) and (54) through (57) in (52) 
, = 1 — yi, — 462 — 3&r. (58) 
Combining (51) and (58) 
i, = 1,1 + y) + 360; + G8) — I. (59) 


Now 2, is given by equation 11. Therefore 
tx ey) On ee ee, 
=a,t, + I,My., + dat? + 41,Meve, + aM yt.ve, 
+ $aMovet. + Fo2Mywete + gaste + g].Mav,. (60) 
Substituting (51) and (53) into (60) and gathering terms: 
A[RT.MsR3 + daM.R, — 40,M,Ri, — tos + 44] 
+ i[4 6 — 4a, — 47,M.R, + a,M,R, 
+ 03 — furMRT — 3a2.M Ry — as)] 
+7(1 +7 —a,+ 7,M,R, + 1,06 — a + a AI,R,) 
+ iE — 303 + 3a.M,R,)) + 2,[1 + ¥ — a] 
+ &[3 6 — gas] + HL—8é — Ge,] —I = 0 (61) 
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The only approximation that we have made up to this point is that 
Vout XY V.,, the collector-to-base voltage, assuming that v, is small. 
Now we would like to express the variables of (61) in terms of the 
independent driving voltage, V,, and the output current, 7, , which 
is linearly related to the output voltage. To accomplish this, we start 
with (58) and make the approximation | 


i, @I — yi, (62) 
where we have ignored the high order terms in (58) and used the linear 
relation 7, 7, . 

Notice that (62) certainly does not imply that we have fixed a linear 
relationship between 2,, J, and 7,. We are simply using this new ap- 
proximate variable in the highly nonlinear (61) for convenience. The 
approximation is justified by the fact that second order and higher terms 
ignored in (62) would appear as fourth order and higher terms in (61). 

Substituting (62) into (61) we have 
ie {él .M3Rz, — ga3(1 — 1)” 
+ (1 — y)* + da2M,Ri(1 — yy — $M2Ri(1 — y)} 
+ a{3 61 — y)*? + MRL — y) — 31.02Ri — f0n(1 — y)’ 
+ IlonM,R,(1 — y) + 3&1 — y)? — gas(1 — 7)’ — $a.M2R7)} 
eho) = ely) pikes 
+ I[é6(1 — y) — a(1 — vy) + a@,M,R;] 
+ I$ — y) — $as(1 — y) + $e2M,Rz)} 
+ Ily — a] + 1°13 6 — $a.) + PEE — gas] = 0. (63) 
At the 100 mA bias levels where we are assumed to be operating, 
r, S 0.5 Q. Also rf & 10-20 Q and R, can only increase the R, + 7 
sum in (55). Hence y < 1 and will be ignored in (63). Thus we have 
effectively substituted J for 7, in (61) to obtain (63). This substitution 
is not justified by requiring the assumption I > 2, in (62) (that is, 
a current source drive); but is justified on the grounds that the sub- 
stitution of (62) into (63) did not generate new terms in (63) for y < 1. 
Equation (63) is of the form 
ae titbtet+ifdt+el +f’) + ol +hP +5 =0. (64) 


Now to force linearity we would like to require 


Vous = EV, 5 where k is a constant. (65) 
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But from (53) and (54), (65) can be expressed in terms of the 
variables of (64) as 
_ _kR, +1) 
1, = Re I 
Substituting (66) into (64) and gathering terms 
I’{aB* + cB’? + fB + 4] 


+ (dB? + eB +h] + I[dB + g] = 0. (67) 
Now I is an independent variable so that this equation can hold only 
if each coefficient is simultaneously zero. In the linear term 


= BI. (66) 


dB +g=0 (68) 
eames! 
i d 
Ignoring terms in y and noticing that I,M@,R,; < 1 in (63) 
tw ee.) eee 
BT ae eR: ve 


The constant B should be easy to identify. For small M, (low levels 
of V.), B = B,. However, at the higher values of 2, and V,, 1,.M,R1 
can be on the order of (1 — a,). Thus, roughly speaking 


B2z > 1. (70) 


Substituting (69) into (67) our final coefficients to be equated to zero 
in (67) become 


aB’ + cB’? +jfB+ 7=0 (71) 
bB’ + eB+h=0. (72) 
Substituting for a, c, f, 7 in (71) by comparison between (64) and 
(63) ; ignoring terms in y: 
B*gl.MsRi — gas + 3 + Fa2M,R, — 30,M2Ri] 
ate Bla.M Ry, + 3& — fas — 30,M,R7) 
+ Bigt — ga3 + za. Ry] + [3 — gas] = 0. = (78) 
Gathering terms in Ry: 
RY [Ma][3B"] + Bi[—cyM,J[B* + B’] 
+ Ryla.M,\[3B° + B + B] + 3B’ + 4B 4+ 3 
— a,[3B° + $B? + 4B + 4] =0 (74) 
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Using (70) we can ignore lower powers of B, and a2M,, being the prod- 
uct of second order terms, is very small compared to the other coef- 
ficients in (74). Thus (74) becomes 


(I,M;)R2 — R2(8a,M,) + £ — a = 0. (75) 


Now substituting for b, e, and h in (72) by comparison between (63) 
and (64); ignoring terms in y: 


Bk 86 — dae + a,M,R, — 41,M.Ri] 
+ Blé — a. + a,41,R,] + [$ 6 — da.) = 0. 
Gathering terms in R,: 
Ri{—-1.M.]{3B"] + Rrla,M,j[B’ + Bl 
+ o§B" + $B] — «(3B°+ 3B) =0. (76) 
Using (71), (76) becomes 
—(,M.)R; + (2a,M,)R, + 6 — a, = 0. (77) 


Equations (75) and (77) are the relations that must be satisfied to 
satisfy (67), which in turn results from the requirement of a linear 
input-output relation, (65). 


APPENDIX B 


Relating Current Gain Nonlinearities to the Bias Current 

Riva? has shown that the small signal gain of a transistor can be 
closely matched to an expression of the form 
B = | en: login Gat. ‘iax) ale 2a logio € logyo ones ae) + iA ee (78) 
Where 


remax = Maximum de current gain 
= collector current bias where hyemax OCCUTS 
a constant characteristic of the transistor. 


I. max 


a 


I] 


Differentiating the denominator of (78) reveals that the maximum ac 
current gain (Bmax) occurs for Ip = Icmax/e. Call this current I,,. Then 


B = Ryemaxl@ logio (-/I.p) — a login e + 1]7°. (79) 
At the peak in the B(J,) curve, I, = I), and 
Bmax = Mtemax/(1 — a logi, e). (80) 
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Substituting for remax In (79) 











B = Bmaxll + a(1 — @ logiy e)™* logty (Ie/Ten)]™. (81) 
Then, for 
= a(1 — a logy e)* login ¢ (82) 
B = Binax/ [1 + A In? jia)\ (83) 
B Qmax 
eS SS . (84) 
1+ 
B 1 + AQmax ln? (7) 
max T., 
Where 
AI, =a Al, = ( He ng +144 ap) AI 
ee irae a a | aan) mae a 
= doe 724 Lda rs 
AI, =a, AI, + dL, Ali + 9 di Ar? 
=a, AI, + 40, AI? + 1a, AI? . 
Thus 
d 
a, = 2 a, (85) 
and 
gO | pon 
a; = 3 oP = 1.5 ae (86) 
Now, taking J, © I, , from (84) 
wy = Allg I Ua Fa) 67 
1s ee E a AQinax In? (i) | 
Dice Lies 
at I, = I,. In essentially all cases 
0.04 < fe < 25 
ye 
Bmax > 30 
Adinax < 0.15. 
Thus, to within 10 per cent in the most extreme case 
at & —(4Aainax/Bmaxlo) In (Lo/Tcz)- (88) 


Then, from (86) 
a = (6Aaiax/ Brnnel : In J o/el ae (89) 
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Now to solve for A, notice that, from (78) 


a( Ze) = Ntemaxl@ log’, e — 2a logis ¢ logis e? + 1] (90) 


= grace 

But from (82) and (90) 

ae = Bus + A). (91) 
Therefore, 

A = (Bo.3 ~~ | peeren Vd (reer (92) 

where Rtemax May be measured at 

i, = I, sane. a I.,/e. (93) 
APPENDIX C 


The Qualitative Distortion Model 


The purpose of this appendix is to support the qualitative picture 
of algebraic distortion cancellation given in the text. The development 
here is not intended to be rigorous, but rather to strengthen the reader’s 
ability to share the author’s insight into the cancellation mechanism. 
We have argued that the net distortion current, D, is the algebraic 
difference between positive and negative distortion current compo- 
nents, A and B, dependent on output voltage and current, respectively. 
Express this relation as 


D=A—B. (94) 


But, for A and B monotonic in voltage and current, the ratio A/B is 
a measure of the load. Define this measure as 


A 


R= 3" (95) 
Now 
D = B(R — 1). (96) 
On a dB basis 
20 log = = 20 log |R—1 |. (97) 


Fig. 12 is a plot of 20 log |R-1| as a function of R. Compare this plot 
with that of Fig. 2, which was measured in the laboratory. The simi- 
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Fig. 12 — Decibel measure of the small difference between large numbers. 


larity between the nature of these two minima adds additional weight 
to the idea that exact algebraic cancellation is involved in producing 
the net distortion frequencies. Thus any dependence of distortion on 
frequency should be compensated at distortion frequencies and not at 
input frequencies, since it is at the distortion frequency that cancella- 
tion takes place. 


Identification and Synthesis 
of Linear Sequential Machines 


By P. J. MARINO 
(Manuscript received July. 14, 1967) 


This paper considers the identification and synthesis of linear sequential 
machines from their state transition tables. Necessary and sufficient con- 
ditions for linearity are derived which form the basis of identtfication tests. 
A sufficient condition leads to a method for coding the system’s state vectors 
in a fashion consistent with linearity but which does not entail trial and 
error. The coding process is analytic in nature and allows the coding of 
state vectors independently of the coding or linearity of the output table. 
Both the Moore and Mealy models are considered in deriving coding pro- 
cedures for the input and output vectors. 


I. INTRODUCTION 


This paper develops a method for identifying and synthesizing 
linear sequential machines using their state transition table representa- 
tion. The basic objective is to construct a procedure which can be 
efficiently implemented by a digital computer. Towards that end, we 
develop simple and easily used preliminary tests which reject non- 
linear systems to precede the time consuming synthesis, or state cod- 
ing, process. The method for the coding of states is completely analytic, 
with the result that trial and error processes are not required. 

Consider the symbolic state transition table, Table I. 

The input vectors, u, have m components (2 S$ M S 2”),* and the 
next state vectors, s;, , and present state vectors, s; , have n unspecified 
components (NV S 2”). The vector components are defined over a 
modular field, and here this field is taken as GF(2). Most of the results 
obtained below can be easily extended to other prime fields. 

In terms of the state transition table, a linear sequential machine 


* This paper considers tables which have at least two distinct columns of next 
states (nonautonomous systems). 
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is defined as a system which changes state according to the equation 


Si. = As; - Bu, ° (1) 


A and B aren X n and n X m matrices, respectively.* A linear se- 
quential machine is called fully linear when its symbolic output vector, 
Zie, obeys 


Ziz = Cs; + Du, (2) 


where C and D are matrices of proper size. 

When D is the null matrix, the last two equations represent a Moore 
model of the linear system. Otherwise, the equations describe a Mealy 
model. Cohn and Even have given a method for model conversion in 
linear systems.? 


TABLE [ 

Uy, Ug oc? Up octt Us 
8S: Si. Spo ttt Syn tt Saar 
S82 8a, Sax» ttt Sar ttt Soar 
8; Sin Sig ttt Siz tt Si 
Sy Sar Swe *'') Swe *c¢) Shar 


In recent years linear sequential machines have been studied ex- 
tensively. The motivation for this activity stems from two sources. 
Not only do linear sequential machines exhibit interesting mathemati- 
cal and theoretical properties, but they have found a wide range of 
practical applications; for example, memory addressing circuits, com- 
puting over finite fields, counting and timing circuits, error correct- 
ing codes, encoding and decoding circuits, and generating pseudo- 
random and minimum-time test sequences. 

As persistent research led to greater understanding, several investi- 
gators developed synthesis procedures for linear sequential machines. 
Davis and Brzozowski? have reported a method for the synthesis of 
nonsingular systems (systems in which, under each input, every pre- 


'* The addition and multiplication operations are modulo 2. Also, the entries 
in all matrices are from GF (2). 
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sent state goes into a unique next state). Their technique is based 
upon an iterative search over partitions of the system states. 

In a mathematically elegant treatment, Cohn and Even! have 
derived a synthesis procedure which is free of trial-and-error processes. 
Coded output vectors are used to generate the state vector codes. Not 
only is it necessary that the system have a linear output, but the 
more severe restriction that the output vectors have been given, a 
priori, a linear coding is also required. 

Yau and Wang® have disclosed a synthesis technique which does 
not require a linear and coded output. The construction of the A 
matrix by examination of a transition graph, which describes the state 
transitions owing to a given input, leads to the coding of the state 
vectors. The method requires the system to have 2” states. When N < 
2”, a sufficient number of “don’t care” states are introduced to com- 
plete the state transition table; however, no suitable procedure is 
given for the specification or coding of the “don’t care” states. The 
lack of complete freedom from trial-and-error routines is another dis- 
advantage of the method. 

In this paper, necessary and sufficient conditions for linearity of the 
state transition table are derived which lead to the development of 
the procedure for coding the state vectors. The method accommodates 
linear systems in general (both singular and nonsingular). The syn- 
thesis procedure is analytic and, therefore, no trial-and-error routines 
are necessary. Also, the state vectors are coded independently of the 
output table so that the coding process is able to treat systems that 
have linear or nonlinear, coded or uncoded, output vectors. Both the 
Moore and Mealy models are considered in deriving coding procedures 
for the input and output vectors. 


II. NECESSARY CONDITIONS FOR LINEARITY 


Forming the sum of two next states, say sj, and s;,, under the same 
present state, s;, yields 
Siz + Siy = Buu, +E Uy); 
since A(s; + s;) = AO = 0, mod (2). Since the sum is independent 
of the present state, it follows that 
Siz + Siy = Sez a Sey = °°: = Siz Sey = 2 OSS Syz + Syy 


for each x and y. Let the equality of these sums, for a particular x 
and y, be denoted by the term state sum, and call the individual sums 
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of pairs of next states component sums. For example, the state sum, 
S11 + S12 = Sa, + Soo = 831 + Se, consists of the component sums 811 


+ 812, S21 + See and S31 + S30. 

As a direct consequence of the state sum: if a present state has two 
identical next states, under two different inputs, then the columns 
which correspond to the inputs in question are identical, or the table 
represents a nonlinear machine.* 

The state sums of a linear system must be consistent over all pairs 
of inputs. For example, assume that a state sum contains component 
sums (written in terms of present state symbols) s; + se and s,; + 83. 
The state sum is consistent only if so = s3; however, if the output ta- 
ble does not allow the reduction of the state transition table by merg- 
ing sg and s3, then the state sum is inconsistent and the system is 


nonlinear. 
In order to check state sums over the entire table only M — 1 state 
sums are required. Taking the input w, as a reference, the state sums 


$11 T Sty St = Sa Sy = 8 = Sv + Syy 
for y = 2,3,..., M cover the table. Since, if sj, + sj, = si + si for 
all y of interest, then for any x 


S32 ae Sjy = Siz + $51 ae Siy + $71 
= Sic + Sia + Sig + Si 
= Biz TH Siy « 


Therefore, it is not necessary to form state sums for all possible pairs 
of inputs. For example, consider Table II. 


TABLE IT 
Uy, Ug Uz Us 
8; Se $8, 83 8% 
S82 8S, S83 8, S82 
Sy 85°. 18h Sa Sy 
S4 $: Se Ss 83 
From inputs w and we, 81 + S2 = 83 + s4 (redundant components sums 
have been deleted) is consistent in the first two columns. From wu, and 


* A similar result has been obtained by Davis and Brozozowski? using a different 
approach. 
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Ug, Se + 83 = 8; + Sg and from uw, and w4, Sz + S4 = 8; + 83. The last 
two state sums are rearrangements of the first and therefore, the state 
sums are consistent over the entire table. 

Another symmetry feature appears in singular linear machines 
(those characterized by a singular A matrix*). If A is singular, then 
for some present states the rows of next states are identical.? This 
follows since a singular A has rank r < n, and therefore, the null space 
of A has dimension n — r, so that As; + Bu, = siz has more than one 
solution for s; given s;, and U2. 

The reduced state transition table of a linear sequential machine 
has additional interesting properties. In what follows only reduced 
state transition tables are considered unless stated otherwise. 

As a preliminary, consider 


Theorem 1: If A is nonsingular, then the reduced table of a linear 
system has an even number of states. 


Proof: If A is nonsingular, under each input, each state will appear 
once, and only once as a next state. Thus, the next state columns are 
permutations of the present state column. As a consequence, each of 
the state sums involves all of the system’s states. If the number of 
states, VN, were odd, then the same state must appear in two distinct 
component sums of the same state sum. That is, the state sum con- 
tains an equality s; + s; = se + s; which implies s; = s,. But this 
contradicts the statement that the state table is reduced. 

Next, a starting result which connects the number of system states 
to the number of distinct inputs? is described by 


Theorem 2: For a reduced, nonsingular, linear sequential machine 
which has N states, the number of distinct inputs cannot exceed 2, if 
N/2 1s odd, or 


t 
2+N > 2" 


t=2 


where t is the smallest integer for which N/2' ts odd. 


Proof: Consider the state sums associated with the first two dis- 
tinct inputs, w# and Ue. 


Si 1 Siz = 8213 + 82 = 8 = 8 $82 = ee = Sy; + Syeo. (3) 


* Common terms from linear and abstract algebra which appear in this paper 
are treated in several texts; for example, see Birkhoff and MacLane‘ 

tue is said to be distinct from u, if and only if the columns of next states 
under we and wy are distinct. 
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Taking wu, with uz, a third distinct input, the following state sum is 
obtained: 


Sir Sir = Sox + Sar = °° = $, + 3: = +++ = Sy, + Syz - (4) 


Writing segments of equations 3 and 4 in terms of present state sym- 
bols as s; + 8; = 8S. + 8, = 8S + Sy =..., and & + Se = 8 + Sig = 


. , respectively, leads to the conclusion that state sum consistency 
requires that equation 4 contain a sum s; + s,. For if it did not, 
then locating the terms of equation 4, which contains s;, say s, + 81, 
leads to 8; + S- + Ss, + 8s; = 0. From equation 3 s; + s; + s, + 8. = 0. 
For compatibility, s; = s,;; so that equation 4 must contain s, + s; or 
contradict the reduction of the table. It then follows that the state 
sums over distinct pairs of inputs must be mutually derivable via 
component sums. 

Let the component sums obtained from the first state sum from 
Uz and us be denoted as follows: 


Si = Si + Siz, coe Ss = 8 + Sie, ses) Sy = Sy1 + Syo.- 


Since there are N’/2 distinct sums, let the symbols S,, 82, --- , Sy/2 
denote the distinct eomponene sums. Then equation 3 can be repre- 
sented by S; = 8S, = = Saja 

In view of the freind: a necessary condition for pes is thie 
all other state sums must be derived from the sums S,, --- , Sy,.. 
In generating new state sums the S;s are paired and the eres 
sums which are consistent with equation 3 are formed by transposing 
terms in the resulting equation. For example, pairing S; and S; can 
yield either of the two equations which do not appear in 3: 


Sir + Sj1 = Sj2 + Si2, 
or 
Sir + Sjo = Sy, + Sio . 


Then it is clear that each pairing of the Ss yields two possible state 
sums. Therefore, the number of unique pairings of the Ss, where each 
pairing occurs only once (this insures that no component sum will appear 
in two distinct state sums), is equal to half the maximum number of 
distinct inputs in excess of the first two. 

Pep ereUne the S; according to subscript parity gives: 


Sy S; vs - Syye- ~1 


S2 Si, er) Swi 
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If N/2 is odd, then one S cannot be paired. Therefore, one sum will 
occur in more than one state sum. Since this is inconsistent with a 
reduced linear table, a system for which N/2 is odd can have only two 
distinct inputs. If N/2 is even, then the number of unique odd to even 
subscript pairings is N/4. The odd-to-odd and even-to-even pairings 
can be enumerated by considering a single row. It is advantageous to 
transform the subscripts as follows: 


Si =8,, Sh = S.,--- , 82 = Ba. 
Then, separating the new symbols by subscript parity gives: 


St S3 he Syya-1 

S2 Sp--+ Sy. 
When N/4 is odd, no pairing is possible. If N/4 is even, the odd-even 
subscript pairings number NV/8. Clearly, the odd-to-even pairings of 


the S’ can be treated by reapplying the same transformation to the 
subscripts. Therefore, the number of allowed pairings is 


t 
Ne 
i=2 
where ¢ is the smallest integer for which N/2! is odd, if N/2 is even. 
Since each pairing provides for the generation of two distinct columns, 
in addition to the first two columns, the number of distinct inputs is 
2, if N /2 is odd or not greater than 


t 
2+N > 2'%, otherwise. 
i=2 


This completes the proof.* 

Theorem 1 leads to a very simple test for the identification of non- 
linear tables. The number of states in the table is used to determine 
the maximum number of distinct inputs. Then, the table is rejected 
as nonlinear if the number of its distinct inputs, or, equivalently, the 
number of distinct columns, exceeds the maximum. Table III illus- 
trates the restriction which linearity imposes upon the form of the 
state transition table. 

There are similar, but weaker restrictions associated with the state 
transition tables of singular linear machines. Consider a system which 
has N states such that each next state column contains d(<N) distinct 
states. (The singularity of the A matrix requires that some rows of 


* A smaller upper bound can be obtained when N + 2”. See the Appendix. 
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TABLE III 
Maximum Number of 

N Distinct Columns or Inputs 
4 4 
6 2 
8 8 
10 2 
12 8 
14 2 
16 16 
70 2 
72 56 
74 2 
76 40 
526 2 
528 464 
530 2 
2086 2 
2088 1568 
2090 2 
2092 1048 
2094. 2 
2096 1836 


state transition table to be identical.) If the singular matrix A has rank 
r, then the maximum number of present states which yields the same 
next state (that is, the maximum number of times a row can be re- 
peated) cannot exceed 2” °”. Since there are N(S2") states, N27” S 
d S 2’.r ¥ nimplies that each column cannot contain all of the system’s 
states so that the reasoning of the last theorem cannot be applied. 
In order to gain some insight into how linearity limits the form of the 
state transition table, consider the case where N = 2”. That is, the set 
of state vectors form a complete set of n-dimensional vectors with 
components over GF(2). Let S denote the set of present states, 
(8: , S82, °** » Sy), Siz is defined as the set of distinct next states, (s:. , 
Sor, ‘** » Saz) under the input u,, and it is assumed that u, = 0. 
Consider the following 


Theorem 8: A linear system which is associated with a singular A 
matrix of rank r, a null inpué vector, and which has all states appearing as 


SEQUENTIAL MACHINES 351 


next states must have at least 2"~" distinct input vectors, and S;, and Sj, are 
either identical or disjoint for all x and y. 


Proof: First, it will be shown that the S,, are cosets of the group 
{S, +}. Since S is a vector space, 0, s; + s; are members of S for any 
8; , 8; which belong to S. If u, = 0, then the vectors of S,, are a subspace 
of S because: 


(t) A(O) = Oe S8,, and 

(7) As; , As; ¢« S,, implies A(s; + s;) © Si 
(This follows since s; + s; = s, e S and As, ¢ Sy, .) 

The nonull input, u, , generates cosets of the group {S, +}, because 
Bu, is an n-component vector which must belong to S and therefore, 
Su + Bu, = Si. . 

It is well known that cosets are either disjoint or identical. Since 
Oe S,,, Bu, ¢ S,,; implies that S,, and S,, are disjoint. Therefore, no 
member of S,, can be used as Bu, if the table is to have a column which 
contains states not found in column 1. If S,, and S,, are to be disjoint, 
then Bu, « (S — 8,,); that is, Bu, can be selected from a set of 2” — d 
vectors. Continuing, the next distinct coset is associated with an input, 
u,, such that Bu, has not appeared in any of the preceding cosets. 
(If it has, then 0 e S,, + Bu, .) Then, Bu, is among 2” — 2d vectors. 
The last unique cosets is generated from a set of d vectors, or 2” — 
kd = d.So that there are k + 1 = 2"/d unique cosets of next states in the 
table. If each present state is to appear as a next state, the table which 
contains the minimum number of distinct columns must be comprised 
of one column from each unique coset. Since A has rank r, d = 2’; 
consequently, there must be 2””” distinct inputs. 

Also, since each unique coset can form 2” distinct columns, the num- 
ber of distinct inputs is not greater than 2”, as expected. 

This section has derived several properties that must be exhibited 
by the state transition table of a linear sequential machine. The con- 
sistent state sum requirement will play a central role in the code 
assignment problem. 


III. SYNTHESIS: THE ASSIGNMENT OF LINEAR STATE CODES 
To each symbolic state, s, it is necessary to assign a p-dimensional* 
vector, v, with components over GF (2). That is, sis > Vie. 
The vector assignment must preserve linearity ; 
v;, = Av; + Bu,. 


kn =n 


352 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1968 


Linear systems must give rise to consistent state sums; therefore, 
the vectors must obey the same sums. Since state sums are only neces- 
sary conditions for linearity, a nonlinear state transition function may 
exhibit consistent state sums. 

For any sequential machine the general state transition equation 
can be written as 


V2. = Av, + Bu, + fe ’ (5) 
where fix is a p-dimensional vector which is a nonlinear function of 
the present state and input vectors. When the symbolic states obey 
the state sums, (equation 3), the vectors must be assigned such that 


V2 a Dig = Vex + Uap SRS ES Oe “Pty Te ORS SU Ne a Oia (6, 
for each x and y. Therefore, from equation 5 it is clear that the non- 
linear function obeys the same restriction, 


fie + fiy = for + foy = tee =f2+ fi = ee = fy. + fx, - (7) 


The selection of the A and B matrices exerts some control over the 
nonlinear function. When uw; = 0, 


Alp, | vp | eee lve) + Ufa eee | at | for] = [on lay oe | v1] 
where 
[v, | v2 | +++ | ol 


is ap X p matrix whose columns are 7p linearly independent vectors.* 
Then, 


fu = fa = poet = fi = 0 (8) 
can be achieved by 
A= [v1 | Vai | woe | Vor lle: | V2 | ae | ele" 
Similarly, 
Alv, |v | Von | v,] + Blu, | tee | ses ieee 
+ is | Sia | eee le eee = [v2 res | ats Vacant 
where tig, ... , Un41 are m linearly independent input vectors, yields 
fie == fis ye tae fism+1 = 0 (9) 


*In cases where the system is singular the matrix of next states (the matrix 
on the right side of the last equality) must be selected so that A has the required 
rank. The rank of A can be determined directly from the repetition of rows in 
the transition table. 
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when 


B= [v2 + Uy | vs + Uy | ss oaere i V1 |[Ue | ws | aes yer ie 


Additional constraints on the nonlinear function become clear when 
equation 7 is examined in light of equations 8 and 9. For + = l,a 
sample equality in equation 7 is 


bet ie = et Jas 


When 7,7 S 7, fii = fir = 0 (by equation 8); so that f;, = fi, . Equa- 
tion 9 indicates that f;, = Ofory S m+ 1,j = 1. Therefore, f;, vanishes 
when? S pand y S m+ 1. Equation 7 implies f;, = f;, = Oandf;, = 
fiyforz,ySm+1,t18S p <j. Finally, f,, = f:2 + fy, when e S m+ 
1 < y, andj S p < i. Table IV below summarizes the restrictions on 
the nonlinear function for systems which exhibit consistent state sums. 


TABLE IV 
Uy U2 amar Um+1 Uns+e2 ware * Un 
Vy 0 0 rey 0 eee eee fia 
Uo 0 0 soe 0 constant oe constant 
Up 0 0 ree 0 Tai oP frar 
Vn+1 fot1.1 ones beets Jovia a fiim+e ae frets =r fiar 
Un fxs. ous fx fyi + fi,m+e eee fi r fist 


A particular code assignment can be verified by comparing state 
transitions along segments of one row and one column with the transi- 
tions predicted by the linear equation. If f;, and f,, are found to vanish 
forp <j Ss Nandm+1< 2S M, respectively, then the code as- 
signment is acceptable. The necessary state transition checks number 
M+ N — m — p — 1 (compared with MN — p — m — 1 checks if 
state sum consistency is not verified before a code assignment is at- 
tempted). The implication is that sufficiently large values of m and p 
will force the nonlinear function to vanish over the entire table. While 
it is undesirable to increase m and p (since this requires more memory 
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elements) it is certainly possible to do so in principle.* However, in 
computing the A matrix it was convenient to construct a nonsingular 
matrix of p linearly independent vectors. The state sums (equation 6) 
which contain no more than N/2 component sums imply that at most 
N/2 + 1 states can be assigned vectors independently. 

Therefore, not more than N/2 + 1 of the coded state vectors are 
linearly independent with the consequence that it is impossible to 
form a nonsingular matrix of coded state vectors when p > N/2 + 1. 
Where p exceeds this limit it is possible, in some cases, to express some 
elements of the A matrix in terms of the remaining elements. This 
method for finding A is far less attractive than forming a nonsingular 
matrix of coded state vectors; accordingly, the bound p S$ N/2 + 1 
will be enforced. The development which follows demonstrates that 
the limitation on p does not obscure a system’s linearity. Similar con- 
siderations lead to m < M. 

Turning to the state coding problem, the state sum will play an im- 
portant role in the generation of equations which lead to the linear 
coding of states. First, attention will be concentrated on nonsingular 
systems, then a later section will treat singular systems. 


3.1 Nonsingular Systems 
Consider the sum of two component sums 
Vix + Viy + Viz + Viy = 0; 
it is true that 


Avi; -+ Viy ++ Viz + Viy) 
= Vie + Vint + Vj21 Vii a fiz ss fiva ae fiz =F tiv 
= (0. 


(The additional subscript indicates that v,, 1s the present state which 
goes into Vig1 under the null input, w,.) Taking viz, viy, and vj, among 
the p independent vectors implies fie: = fiyy = Fier = = 0. Furthermore, 
assigning vectors such that 


Vint + Vint + U521 a Viy1 = 0 (10) 


forces fjy, to vanish. The sum (10) must be consistent with the state 
sums (that is, no state sum can contain an equality of component 
sums which contradicts equation 10). 


* When the input vectors are given as coded it is possible to increase their 
dimension by a translation. 
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By similar treatment of all component sums of a state sum, the 
nonlinear function can be made to vanish in the first column (and 
therefore, over the first m ++ 1 columns) provided that the attendant 
increase in the value of p (owing to the designation of independent 
vectors) does not cause it to exceed its bound. 

If all component sums are treated, it is possible to generate an 
equation of the type 10 in which all four of the vectors have been 
previously designated linearly independent. Clearly, the equation con- 
tradicts the independence of one of the vectors; then, any one of the 
vectors must be deleted from the set of linearly-independent vectors. 
The nonlinear function corresponding to the deleted vector can be 
made to vanish by satisfying the type 10 equation which is obtained 
when the A matrix operates on the generated equation in question. 
Consider the following process for treating a single state sum. 


(t) Select a component sum as a reference sum. Add another com- 
ponent sum to the reference sum. 

(«) Operate on the resulting sum with the A matrix to obtain an 
equation of the type in equation 10. Designate linearly-independent 
vectors as required and mark the vectors for which the associated 
nonlinear function has been forced to vanish. 

(wii) Verify that the equation obtained in step 7 is consistent with 
the state sum and the other equations obtained in w. If all vectors 
in the equation generated in iw are linearly-independent, delete one 
of the vectors from the set of linearly-independent vectors and repeat 
step 22 using the generated equation as the sum upon which A operates. 

(iv) If one of the type 10 equations has three linearly-independent 
vectors, use it as the sum in repeating step i. Otherwise, add another 
component sum to the reference sum and repeat step 7. Repeat w and 
a“ until an inconsistent equation is generated (the system is no- 
linear), or until all vectors have been used (the system is linear). 


The first time the process passes through step 72, three linearly- 
independent vectors are required; in subsequent passes at most one 
additional independent vector is needed. After the first pass through 
the process N/2 — 2 unused component sums remain. Therefore, not 
more than N/2 + 1 independent vectors are required for the process, 
precisely the upper bound on p. If the state vectors were coded using 
this process, the system would have an undesirably large number of 
memory elements. Therefore, this process is not an efficient design 
procedure. However, completion of the process implies linearity of the 
first m + 1 columns with the result that the A and B matrices can be 
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determined. Linearity of the table over the remaining columns is 
dependent upon the coding of the input vectors. 

As previously indicated, linearity over all of the columns can be 
attained, in the worst case, by increasing the dimension of the input 
vectors. In this case, m = M — 1 would yield a linear system. If it 
is assumed that the input vectors are uncoded, or can be recoded, 
then it follows that completion of the process is a sufficient indication 
that the system is linear. (The problem of coding input vectors is 
treated in a later section.) The process will be referred to as the 
maximum memory process. 

In order to illustrate the maximum memory process consider the 
reduced table, Table V. 


TABLE V 

0 1 
8; 8; S6 
8» 8 Ss 
83 85 S3 
84 S2 84 
85 Ss 87 
S65 84 So 
Sy Sq 8 
Ss 83 85 


In terms of the coded vectors the state sum is 


V1, + Ue = V7 + Vg = V3 + Us = Vo + U4 (11) 


(which is consistent over the table). Using v1; + v¢ as the reference 
sum, 


A(Qv, + 06 + v7 + dg) 
= fi.o + Io.o ze fr.o + fs.o + V1 + V4 + V6 + U3 = 0. 


Designating v1, Vs, and vz as linearly-independent vectors and satisfy- 
ing the equation 


Vv, + U4 + 0g + v3 = 0 (12) 
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fr.o a fo.o = fr.0 > fs.o = 0. 
However, vectors v1, vg, and vy appear in two component sums of 
equation 11, the first and last; this implies 


0; + 06 + Vg + vg = 0. 
Adding this to equation 12 leads to ve = v3. This contradicts the re- 
duction of the table; therefore, equation 12 is inconsistent with the 
state sum. The system is nonlinear. 

As pointed out, the maximum memory process is not a suitable 
vehicle for the economical design of linear sequential machines. The 
process extracts a limited amount of information from the state 
transition table. This can be improved by considering all of the unique 
state sums (that is, from the M — 1 state sums by pairing the first 
input with every other input), and using such equations as 10, which 
the process generates, to better advantage. 

Consider the following procedure: 


(2) Form the M — 1 unique state sums. 

(it) Select a reference sum from one of the state sums. Add another 
component sum (from the same state sum) to the reference sum. 

(iit) Operate on the sum with:the A matrix. Designate linearly-in- 
dependent vectors as required. Obtain an equation like equation 10 
and verify that it is consistent with the state sums. Mark the vectors 
in all state sums and equations of this type which have been guaran- 
teed a linear state transition under the null input by this step. If all 
vectors in the equation obtained have been designated linearly in- 
dependent, delete any one of these vectors from the set of linearly- 
independent vectors and repeat this step using the equation as the sum 
upon which A operates. 

(iv) In the state sums where at least one component sum has had 
both vectors marked in step 7, search for a component sum which has 
one vector marked or, search over the type 10 equations for one 
which has three terms marked. If such a component sum or equation 
is found, use it in repeating step i. (Since three of the vectors make 
a linear transition, the nonlinear function which is associated with 
the fourth vector can be made to vanish in step 2 without designating 
another linearly-independent vector.) Otherwise, 

(v) If the sum of the type 10 equation is unique and has two vec- 
tors marked, then use it in repeating step 122. Otherwise, 
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(vz) Form a new sum for use in step 2 by adding the reference sum 
to another component sum (from the same state sum). Repeat step 217. 


The process is repeated until all of the vectors have been marked 
in step v2 (the system is linear) or until an inconsistent equation is 
generated in step 277 (the system is nonlinear). The coding process for 
the state vectors is initiated by assigning arbitrary, but linearly in- 
dependent, vectors to the state vectors so designated by passes through 
step wt. The remaining state vectors are coded using the type 10 equa- 
tions which were generated in step 12 in conjunction with the state 
sums, 

The application of this synthesis procedure is more straightforward 
than its description would indicate. This is best illustrated by an 
example. Consider Table VI. 


TaBir VI 

GG) @ 

0 0 1 
8; S6 S4 S2 
So Si 83 Ss 
Ss Se Sy Ss 
S4 83 81 Ss 
S85 S4 S65 87 
86 85 Ss 83 
8; Ss 85 81 
Ss S7 So Sa 
Sp S11 Sie S»9 
S10 So S10 Sir 
811 S10 S9 Sie 
Sie S12 Si1 Sio0 


In terms of the coded vectors, the state sums are: from inputs 6) and 


(i) 


Ve + Us = V, + V3 = Ve + V7 = Us + Vg = Vir + V2 = Vo + Mo ; (13) 
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’ 0 2 
from inputs (°) and 1)? 


Ue + Vg = 0, + Ug = Vs + Us = V4 + V7 = Vir + Vg = Vio + Ye - (14) 
Equations 13 and 14 are mutually consistent, and both are consistent 
over the table. 

The observation that Avi2 = V1 leads to the assignment of the null 
vector* to vie (therefore, fi29 = 0). Then in step i of the synthesis 
process, take v1, + v yg of equation 13 as the reference sum, and add 
it to Us + vg of equation 13. In step 2 form 


AQ, + 011 + U5 + 2s) 
= fie.o + fis.o + fs,0 + fs.o + Viz + V10 + % SO ae 


Since fioo = 0, taking v3, and vs; among the linearly-independent vec- 
tors and the satisfying equation 


Vig + Myo +g + 07, = 0 (15) 


leads to fi1.0 = fs, = fso = 0. Also, the set of vectors for which the 
nonlinear function vanishes, denoted by JL, is 


L= {Vie 9 %11, Us; vs}. 


Equation 15 is the sum of two component sums of equation 14. There- 
fore, equation 15 is automatically satisfied. 

At this point, component sums do not satisfy the conditions of step 
w. In step v equation 15 is not unique. In step vi taking ve + v7 with 
the reference sum yields 


A(z +, + ve + V7) 
- fie.o be fis.o =F fe,o rs frz.0 =i Vi2 ++ Vi0 + V1 + Us. 


Since fio = fir = 0, designate v2 as a linearly-independent vector. 
Then, 


fe.o = fr.0 = 0 
if 
Vio + io tv, + vg = O. 
The last equation is a rearrangement of terms in state sum (14) and 
therefore it is automatically satisfied. 
* The assignment of the null vector is somewhat arbitrary. It has been shown 


(Yau and Wang’) that the null vector can be assigned to any state which is 
mapped into itself under the null input. 
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Updating the set L, 


L= {Oia Vit» Vs y U7 y Us 5 Vo}. 


The conditions of steps iv and v are not satisfied; in step vi, adding 
V, + vz and the reference sum yields 


A(QYyo + 4, + 1 + 03) 
= fie,o + firjo + fi,o + fa,0 + 12 + Vio + U6 + U2 - 


Since fie,0 and fi: have been shown to vanish, including v1 among 
the linearly-independent vectors leads to 


fi.o = fs,o = 0 


Vie + M0 + V6 + ve = 0. 
The last equation is the sum of two component sums of equation 14. 
The set L becomes 


L= {Vie y Vir» Vs y U7 y U5 Us y V2, Yi}. 


In equation 14, vy; + vg, and vg + vs have both vectors marked in 
step 72 while vg + vo, Vs + V7, V11 + Vo and Vio + Vie each have one 
vector marked. 

In step zv, forming 


AQ, +g +056 +02) = fiso + fe.o + foo + feo + Us +0; +s +r . 
Since fio, feo, and fg have been shown to vanish, foo = 0 if 
Ve + v7 + v5; +», = 0. (16) 
Equation 16 is unique and consistent with the state sums. 
DL = {tye , 011» Us » Vz y Us» US y U3 y Vo, Vi}. 


Proceeding more quickly, A operating on v1 + vg + vg + v7 (from 
step iv) yields fo = 0 and 


Ve + V7 + 03 + vg = 0. (17) 


The last equation is unique and consistent with the state sums. Up- 
date the set L; let A operate on v1 + vg + V11 + Vo (from step iv) to 
obtain foo = 0 and 


Ve + 07 + V0 + Wn = O (18) 
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Equation 18 is unique and consistent and also has three vectors in the 
set L. Update the set L. 

From step iv, A(v6 + vz + Vio + Vir) yields fio = O and vs + Us 
+ v9 + Vio = 0 (which is the sum of two component sums of equation 
14). Update the set L. Step iv gives v1. + Vie + Vo + Vio Which, when 
operated on by A, yields 


fio,o = O and Vio + M2 +11 + = 0 


(which is the sum of two component sums). L contains all of the 
state vectors; therefore, the system can be coded. 

There are four linearly-independent vectors (p = n = 4). Make 
the following assignment of the linearly-independent vectors: 


rH OOO 


Since v;. = , all of the component sums in equation 18 equal 


ooo eo 


V1) +2 = 


ooo rf 


Then, from equation 13 it follows that 


1 1] 
0 1 
Ys =U lol = lol» 
0 0 
1 a 
0 0 
= + ol = fy 
0 0 


and 


ae es 
: Ne 


re OOF 
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Similarly, the component sums in equation 14 are each equal to vy + 


] 
oS ; with the result 

1 

1 1 

1 ] 

U6 = V2 + 0 = 1 

1 J 

1 0 

1 1 

U4 = v7 + 0 = 1 

1 1 

1 0 

1 1 

Vg =, + 0 = 0 

1 1 

1 1 

1 1 

Vio = U2 + 0 = 0 

1 1 


To calculate A consider 


Alvi | Us | Ve | v4] [v0 | v4 |v, | vel 


1000 tO, A 
Ors 0) ON ex ge 2 AO: ay. 
001 0 0101 
0001 pis. si Ss 


Bl | = [vs + Us | ve + U6], 
a | 
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with the result 


1 1 
pal? } 
0 0 
0 1 


Since equation 16 and 18 are consistent with the state sums and 
since they were not used in the coding process, they are redundant. 
It is easy to verify that the equations are satisfied by the code 
assignment. 

The process produced a code assignment for which the coded state 
vectors had the minimum number of components. This will not be 
true for all tables. Considering the way the process extracts infor- 
mation from the table leads to the conclusion that the attainment 
of the minimum component coding vector depends upon the con- 
nectivity of the sequential machine. In a case where the machine is 
not strongly connected there is a possibility the process will require 
—p > n.* (The last example involved a machine which was not 
strongly connected.) In order to illustrate this consider Table VII. 


TABLE VII 
0 1 0 1 
81 S6 S4 S89 S11 Si2 
82 81 83 Sio S89 Sio 
83 So 87 Sit Sio S89 
84 83 8; Sie Sie $11 
S5 84 Se 813 S14 815 
86 85 83 Si4 Sis Si4 
87 S83 S5 Sis 813 S16 
Sg 87 S82 S16 $16 $13 


The coded vectors obey the state sum: 
Us + Ug = 0, + 03 = V2 + U7 = Us + Us 
Vir 1 V2 = Vo + V0 = Vig + V6 = Via + M5 - (19) 


*A sequential machine is said to be strongly connected if it is possible to reach 
any state of the system starting in any initial state. 
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Taking vig = 0 leads to the seection of vig + vig as the reference 
sum since it is an advantage to exploit the fact that fig9 = 0. For 
brevity, the results obtained from step wi of the synthesis process are 
shown in Table VIII. 


Tape VIII 
Additions to 
The Set Linearly 
The Sum A Operates The Equation L Inde- 
Upon Obtained pendent 


Vs +3 +g t6 6 +07 +d, +H, = 0 Vie 5 Vis > Vg ,Us Us 4 Vig 


Viet 07 ++ 6 +vg +03 +215 = 0 Via » V7 U4 U7, Uy4 


Vie Fs TU ty re + +03 +0; = 0 V6 
Vie + 3 + 4 + 45 Vie + is + 13 + 5 = 0 Vis 
Ye Fig $07 +2 V6 +4 +, + 0s = 0 Va 
Vo Fs FU +s ie +5 +6 +0, = 0 Vy 
Vie FM3 +My +03 Vie + V4 +6 + 2 = O V3 


At this point it is observed that v9, vio, Vi1, and Vy. are not in L, 
and more importantly, it is not possible to involve these vectors in a 
relationship by application of step 7 without introducing another 
linearly-independent vector. By continuing the process it can be 
demonstrated that the system is linear. 


A (046 + m3 +o, + Vio) yields tii.o = fie.0 

when 

Vie + U4 + V0 + M2 = O. (20) 
Then, A(vig + Via + Vio + Vie) leads to fio = fie,0 
when 

Vig + 5 + Vie + Vg = 0. (21) 
A, operating on the last equation, gives foo = fiz,o 
when 


Vie + Us + V2 + 1, = 0 (automatically satisfied). (22) 
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The system is linear since designating vy. as a linearly-independent 
vector leads to 


fieso = foro = fiovo = firyo = O. 


It is also of interest that the set Z which was obtained by the process 
can be coded using the four linearly-independent vectors. This will 
be true in general. 

In order to insure realization of the transition table with the least 
number of memory elements, it is important to develop a means for 
keeping p at its minimum value, n. First, a general method for the 
reduction of the number of vector components will be developed. Then, 
the method will be applied to the problem at hand. 

Let the set Z, denote the largest set L generated by the synthesis 
process using 7 linearly independent vectors; let L,, denote the set of 
vectors which require additional linearly independent vectors in order 
to become members of L. The n-component vectors y are members of 
L, , and the n-component vectors 7 are in L, . 

The members of LZ, can be coded. Taking the first n vectors of L, as 
linearly independent (since order is unimportant), the calculation of A, 
after the coding, leads to 


Aly; | Yo | — | Yn = [vio | Yeo | coe | Gals 
This can be simplified by coding the linearly-independent vectors such 


that [y, | yo | --- | yal = I, (the n X n identity matrix). Then, 


A = [y1o | Yeo | +++ | Ynol- 


Suppose the set Z,,, is tentatively formed by designating another 
linearly-independent vector. It is clear that the vectors in L,,, (and 
Ln+1) have n + 1 components. In order to preserve the coding of L, 


take (¥) e Lz+, where y e L,. That is, the vectors which have been 


coded over 7 linearly-independent vectors are increased by one com- 
ponent (which is taken as zero. Members of L, which become members 


of Dns, will be denoted as (7), Let (2-01) denote the (n + 1) * linearly- 


independent vector where #1 is any coded vector not in L,. The 

(n + 1) X (n + 1) matrix A is given by 

at Yo Un Gust | = Yoo Yno ea) : 
0/0 0 1 | 0 0 0 1 
* If jnz1 and Jn41,0 are in different sets (Ln or L,), then the known A matrix can be 

used to determine the coding of the vector which is in LZ, . 
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From the previous coding of L, , it follows that the left side of the last 
equation can be written as 











A de fs | 
al ees 
Therefore, it can be verified that 
A — AL AGns1 ae Passe 
A= ; 0 i (23) 
Observe that if A has the form 
= A!l0 
i=[4/9], (24) 


then A operating on any vector which has the form (7) will yield a 
vector of the same form. Similarly, all vectors (¥) are mapped into 


another vector where the last component is zero. Since all of the n 
component vectors y (of Z,) and all n component ¥ vectors have different 
codes, then the last component of the vectors in L,,, can be deleted. 
Also, the matrix A (in equation 23) is the required matrix. 

In view of the foregoing, a code transformation, acting on the coded 
gj, must be found such that A has the form of equation 24. It is well 
known (for example, Cohn and Even’) that the code transformation 
y’ = Ry, where F is a nonsingular matrix, cannot alter the linearity 
of a system. Irom the state transition equation 1 it is easy to show that 
this type of code transformation produces a new matrix of the form 
RAR". 

It is required to find an & such that 


Tp-1 _ | A}0 
RAR - [4/21 (25) 


Comparison of equations 23 and 24 indicates that R must have the form 


I, | T 
R= | #1. (26) 


Using equation 26 in equation 25 leads to the following restriction 
on the vector 7: 





AGn+1 a Yn+1.0 = (A a [,) T. (27) 
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Also, applying the coding transformation yields 


els) (9 
et) = (047) 


y + T cannot be a member of L, if the (n + 1)** component is to be 


deleted. 
Therefore, the process of reducing the vector components by one is: 


and 


(t) Determine the vectors 7’ which satisfy (27). 
(72) Of the vectors obtained in 7, select one which preserves the 
identity of L,,. 


Returning to the example, it can be verified that the following is an 
acceptable code for all of the states except v» , V10 , Yi1 » ANd V2: 


1 0 1 0 1 
pe 1 ite _ 10 _ |0 ae 0 
GL, RS: OR gpa Egat eG 
1 0 1 1 0 
0 0 1 0 
_ {1 _ 10 _ fil =i! 
a Se ala % = Jol? %3 = |g 
1 0 0 0 
0 0 0 
_ |0 2.i(h aa _ |9]. 
"nO OQ} ? V5 = ol? a Y%X6 = 0 
1 1 0 
The corresponding A matrix is 
0010 
pe: 001 1) 
100 0 
1101 


The set L, is {v9 , Vio, V1» M12}. Set v equal to any vector not in L,, 
for example 
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From the table v9,9 = Vi1. To determine v4,, add equations 21 and 
22. The result is 


1 
0 
Vi. = V9 HU5 +3 = Eh 
1 
Then, 
001 0/f1 1 0 
00 1 14/0 0 1 
Ady + V9,0 = + = 7 
1 0 0 O}f1 1 
1 1 0 1)\0 1 0 
and 
101 0 
es 011 1} 
101 0 
1100 
Substituting in equation 27 yields 
1 0 1 0 0 
Oe ae ae ea 
101 0 
110 0 0 





An acceptable solution is 


Fe Oc © 
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1 0 1 
: 0 0 oO; . 
since the new v,, 1 + eect ne is not a member of L,. From 
0 1 1 
equations 20 through 22, 
1 1 1 1 
ly = 0 atts a0 and sibs 
Vg = | ) “o> 1 ) Uv", 1 ) V2 = 1 
1 1 0 0 


It can be verified that this is an acceptable coding. 


3.2 Singular Systems 


The synthesis procedure which was developed in the last section 
can be readily extended to treat the coding of singular systems. 

Consider a state transition table that has d (d < N) distinct rows. 
For linear systems, tables of this type have columns of next states 
comprised of sets of d distinct entries. Therefore, the state sums 
(with respect to the first column) have the form 


Vi tye = tt = Vi He = tt = Va Ha, 
LOPS By. is IVE, (28) 


In general a given table can have columns that are disjoint sets of 
states. Also, when 2d < N, these columns can generate state sums such 
that the only state vectors two state sums may have in common are 
from the first column. State sums of this type will be called distinct 
state sums. 

An attempt to apply the synthesis algorithm of the last section 
to a table which has distinct state sums can lead to undesirable 
results. Since the equations determined in step 2 of the algorithm 
can contain only the d distinct vectors of the first column, and since 
all state vectors must be guaranteed a linear transition to column 
one, it is clear that in step w at least one linearly-independent vector 
must be designated for each distinct state sum. As a consequence, a 
system coding that requires a large number of components may re- 
sult. In order to avoid this situation it is useful to introduce the 
concept of independent inputs. 

The set of independent inputs is the set of input vectors that spans 
all of the input vectors. That is, if we, ..., Wnai are the independent 
inputs (the set of linearly-independent input vectors), then any other 
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vector, say u,, can be written as 
m+1 
ig Dy Ope, 


z=2 
where the a,, are constants which can be arbitrarily assigned when 
the inputs are uncoded, or they can be determined by inspection of 
uy when the inputs are coded. 
Recall that Buy = viy + va, for any 7; allowing B to operate on 
the expansion for wu, yields 


m+1 


Viy a vn = » Ayz2(03(2)2 a V5¢2)1)) (29) 
xr=2 


where each j(%) can assume any row index. 

The state vector vi, can be forced to make a linear transition under 
the null input if all vectors in column one and all of the v;;.)x have 
been guaranteed a linear transition in step 2, and if (29) holds. To 
demonstrate this, use the A matrix to operate on the sum of vy + viz 
and another component sum; then, compare the result to that obtained 
by repeating the operation after vj, + va has been replaced by its 
expansion. The equations of the type (10) which are generated are 
required to meet the conditions of step i. Since each distinct state 
sum not associated with an independent input can be treated in this 
way, these state sums meet the conditions of step iv without the desig- 
nation of a linearly-independent state vector. Also, since all vectors 
in column one make linear transitions, all other vectors in the state 
sum can be treated via step iv. 

After the independent inputs have been identified (for coded inputs) , 
or designated (for uncoded inputs) the synthesis procedure of the last 
section*® can be applied over the state sums associated with inde- 
pendent inputs. The remaining vectors could be treated by employing 
equations of the type (29) and entering the synthesis process at 
step ww. 

A modified synthesis procedure of this nature would require con- 
sideration of the equations of the type (29) in the coding process. 
The modifications are compatible with nonsingular systems. 

The remainder of this section develops an alternative synthesis 
procedure that is better suited to capitalize on the redundancies found 
in transition tables of singular systems. 

*It is necessary to provide for the selection of another distinct state sum 
after all component sums of the state sum under consideration have been ex- 


hausted. This can be accomplished by selecting the reference sum in step vi from 
another state sum. 
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When, for example, columns 1 and 2 are disjoint sets of states, 
(28) contains 2d distinct vectors. It is asserted that the 2d vectors 
of this distinct state sum can be coded over the r + 1 components 
(where r is the rank of the A matrix) by a modified form of syn- 
thesis procedure of the last section and that the remaining vectors 
can be coded by a simple relationship. Since only one distinct state 
sum is to be considered, the information concerning the designation 
and deletion of linearly-independent vectors (in step 22) which is 
implicit in the other state sums must be incorporated. This is readily 
accomplished by generating equations of the type (10) over the re- 
maining state sums and equations like (29). Then, the unique equa- 
tions, which must be compatible with state sums, are used to augment 
the state sum; that is, these equations, as well as the state sum, are 
used as state sums in coding over r + 1 components. (Notice that the 
equations in question contain only vectors from column one.) 

Partition the vectors such that the upper partition contains r + 1 


components. That is, 
a [2 | 
a tt 
Viz 


Then, using the augmenting equations, 
Vir Hig = tts SU Hg Ss = Va + Vie 


and the state sums formed by columns which may appear in the table 
that are permutations of the first or second columns in the synthesis 
process, will yield a coding of these vectors and an (r+1)X(r+1) A 
matrix. Let A’ denote this matrix of rank r. The relations 


N2" sas 2° 
(from Section 11) and 
NO 
imply 
oO 200-2 2" (30) 


which verifies that 7 + 1 components are required to code the vectors 
in the first two columns. 

Before continuing, it is convenient to separate the present states 
into sets such that all members of a particular set give rise to identical 
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rows of next states. Let V; denote the set of present states which are 
associated with the j* row. If one or more members of the set has been 
coded over r + 1 components, then select one such r -- 1 component 
vector as the characteristic vector of the set. Denote it by the symbol 
cv}. If no vector in V; has been coded over r + 1 components, the 
characteristic vector can be determined from 


Taryyth — , 
A’cvt! = vi. 


This last equation has at least one solution for cv/ since the columns of A’ 
must span all of the v’ vectors of the first column. Also, the cv} so deter- 
mined is not a characteristic vector of any other set. Then, taking the 
system A matrix (n X n) as 

i 

0 


A’ 

a=|4 

will make the linearity of the transitions in the first column depend- 
ent only upon the coding over v’. For the vectors which have been 


coded over v’, set the remaining components, the v”, equal to the 
n—r—1 component null vector. (This is necessary, if uw, = 0.) That is, 








vile ph = 0 for t= 1,2,---,d. 


Thus, all vectors in the first two columns are completely coded. 

The remaining vectors can be coded by the following process. Con- 
sider the column of uncoded vectors under an independent input u,. 
From the state sum of w,; and w,, it follows that 


Vy = On + My, + ON, for j = 2,3,---,4. (31) 


Since v,, and v;, are known, once v,, is coded, the coding of all other 
vectors in the column is determined by equation 31. v{, can be set equal 
to the characteristic vector of the set V in which v,, (as present state) 
belongs. This will insure that the present state v,, makes a linear transi- 
tion under input u, . v{/ can be set equal to any n — r — 1 component 
vector that has not been previously used as a v” vector. When uw, is 
not an independent input, v;, can be obtained from equation 29 after 
the independent inputs have been treated; then equation 31 gives the 
the coding of the remaining vectors in column y. 
To illustrate the process consider Table IX. 
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TABLE IX 

0 1 0 0 0 

0 0 1 1 0 

0 0 0 1 ] 
8, Sy S3 Ss 87 9 
Se So S4 86 Ss S10 
83 S$} 83 Ss 87 Sg 
S4 S82 S83 S85 Ss Sio 
85 8) 83 85 87 So 
So So Ss So Ss Sio 
87 8; 83 Ss S7 Sy 
83 S2 Sy 86 83 Sio 
S89 S$; 83 S5 87 So 
S10 So Sy Se S3 S10 


The state sums are 


Vv, + =e +, 
Vv», tvs = ve +4, 
V, + 07 = Ve + dg, 
V,) + V9 = v2 +0. 


The state sums are consistent over the table. Take the second, third, 
and fourth inputs as independent. All state sums and equations like 
(29) generate augmenting equations identical to the null vector. To 
determine 7, note d = 2; therefore, from equation 30, r = 1. The vec- 
tors in the first state sum are coded over two components using the 
synthesis process. The result is 


iQ. #-Q. «O48 
m= \o/? ve = Vo) > a oo May 3 


and 
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The sets V are: 

Veg SOs. 0s (Rats 

Vo = {va >a ; Ve Us 5 Vi0} - 


The characteristic vectors are taken as vf and vj , respectively; that is, 


cu, = @ and cv, = (*). 


The vectors in the first two columns are fully coded by setting the 
last two components of each vector to zero: 





0 1} 0 1 
_ {0 _ |0 fi AA 
Ya A? va = lol? U8 Lala Ya = lol 
LO 0 0 
and 
100 0] 
ve 00 0 
00 0 
000 0 


From the second state sum, 


Vs = 0; + Vo + Ue. 
Take 


Since vg « Vo, 


therefore, 


SEQUENTIAL MACHINES 375 


and 


vB, = 


oCcor 
or Oo O&O 


Similarly, taking 


and observing that 


ve = cv, and vz, =v, +e + ds 
lead to 


and v, = ; 


HOT, 


0 
0 
1 
1 1 


respectively. Since the last input is the sum of the third and fourth 
inputs, vy + V9 = Vy + V5 + V1 + V7. Then, 


1 
0 
: and vy = 0 
il es 
Finally, 
0 0 0 
B= 1 0 0 
0 1 0 
10 O 1 


IV. CODING THE OUTPUT VECTORS 


4.1 The Mealy Model 


Let z;, denote the symbolic output vector for input uw, and present 
state s; . Let the output table contain ZL distinct vectors; then, z,, is an 
l-component vector (L < 2’) over GF(2). A k-dimensional vector 
(i S k), wz , is to be assigned to each symbolic output vector. 
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If the output is linear, 


Wiz = Cv; + Du,. (82) 


This equation implies that all of the equations obtained in terms 
of the coded state vectors (state sums and equations derived in the 
coding process) can be converted directly into relationships involving 
the coded output vectors. For example, if vy+ve = vgtv4, then from 
the output table, for any w,, 


Wy + Wey = Wy + Way - 


Let the equations obtained in this way be referred to as state derived 
equations. 

From equation 32 it follows (by the same reasoning leading up 
to equation 6) that 


Wie bt Wy =? = Wig + Wiy = 2+ = Wye + Wy - (33) 


Such equalities will be denoted by the term output sum. 

Clearly, output sums, and state derived equations must be con- 
sistent if the output is linear. 

It is known that N = 2” for a reduced table of a fully linear sys- 
tem.* Therefore, if N < 2" for a reduced table, the output is non- 
linear. On the other hand, where NV = 2”, the set of coded state vectors 
forms a complete set of n component vectors. Then, when ! > n and 
there is a null input, say w,, it is easy to show that the output vectors 
in column 1 are a subspace of the space of all output vectors. This sug- 
gests that the output vectors in column 1 can be coded over n compo- 
nents setting the remaining 1 — n components to zero (that is, code 
over the subspace only). 

The upper submatrix of the partitioned C matrix, 





| a le. isann X n matrix) 
l—n 


can be determined by considering the outputs associated with the 
independent coded state vectors. Then, 


Calas | +> | on) = [wi | +> | weal 


where the w’ are n component vectors. Recalling that [v; |... | vn] 
= I,, (for convenience in the state coding process) leads to 


* Cohn and Even’ 
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Since the subspace property of the first column must be preserved, 
it is true that C)_, is the n X 1 — n null matrix. Furthermore, C,, and 
the code assignment over n components can be determined by designat- 
ing ” independent output vectors in conjunction with the state derived 
equations and outputs sums. 

For all columns which are premutations of the first column, the 
_corresponding Du, must have zero as components | — n through 1 
(again, to preserve the subspace property). More generally, the 
entries in the first n components of Du, generate permutations of the 
first column, while the entries in the remaining | — n components force 
the output vector out of the n dimension subspace.* 

In order to code the output vectors which are not members of the 
first column, notice that an output sum which involves such a column 
and the first column has N distinct sums and 2N vectors. N of these 
vectors have been coded (the members of the first column) and one 
of these vectors appears in each sum of vectors in the output sum. 
Therefore, if a linearly-independent vector is designated, by assigning 
& nonzero entry in 1 — n components, then all of the remaining NV — 1 
vector codes are determined. 

For example, if, in equation 33, x = 1, then wiz, fori = 1, 2,..., 
N are vectors in the subspace (and can be coded), then setting w4, 
equal to a linearly-independent vector or any vector not in the sub- 
space gives the code assignment for all other vectors in column y 
since Wiy = Wiy + Wiy + Wiz for any 2. The procedure is basically the 
same as in coding state vectors of a singular system. 

The case where L < N can be treated as an obvious special case of 
the above. 


4.2 The Moore Model 


In the Moore model of a sequential machine the output is a func- 
tion of the system’s state alone. That is, D = 0. It is then obvious 
from equation 32 that the number of distinct output vectors cannot 
exceed the number of state vectors. Therefore, the method for coding 
over the n-dimensional subspace introduced in Section 4.1 can be 
applied directly, using the state-derived equations. 


* This property is the identity or disjointness of sets of vectors which can be 
rigorously proven by an argument parallel to the proof of theorem 3 
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V. CODING THE INPUT VECTORS 


5.1 The Mealy Model 


In the Mealy model of a sequential machine it is possible to have 
inputs which do not generate distinct columns in the state transition 
table and yet give rise to distinct output columns. Such cases require 
an interaction in the determination of the B and D matrices. 

Let there be M different inputs, and K(K S 2") distinct columns 
in the state transition table. Take one input as the null input, say, 
u, = 0. Considering one component sum from each state sum, it follows 
that 


Bu, = ty + 4, 


for each u, that generates a distinct column. For convenience number 
such inputs uw, through ux. Similar to the approach of Section 4.1, 
code the input vectors over the first k components. That is, from 


B = [B, | Ba-s] (where B, has k columns) 


select 

B, = [011 + M1 | as | vay + 01,441] [U2 | viet [eal (34) 
where the wu’ are linearly independent vectors formed by the first k 
components of the input vectors, and where v4, + Vi2,..., V1.1 + 


V1,n41 are the k vectors that span the set of the K distinct sums 
of the form v4, + v2. Since it is only the first k components of the 
input vectors that influence the state transitions, it must be true that 
Bm—z is the n X (m — k) null matrix. The remaining k component in- 
put vectors can be obtained by solving the set of linear equations* (in 
matrix form) 


Bul = U1 4- Viy 


At this point & components of all input vectors have been coded. 
Considering the output table, the D matrix can be determined from 


Du, | ee | u,] = (wi + Wie [wi + Wi | a | wi + wi] (35) 
where the columns of the matrix on the right side span all sums of 
the form wy, + Wi, and where tw, ... , U, are m inputs which are 
assigned as linearity independent vectors. 


* These linear equations must be consistent since the columns of By span all 
vectors of the form vu + viy. (See equation 34.) 
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Consider the example in Table X. 


TABLE X 
U, ve Us Us Us Us Ur Us 
Si S17 2a Sul ae. Bile. Si lee. <Sx/2s- “Sales. Ss/25. S3/2ig 
Ne. alee Bate Seleg Sales. 8ifee- Siler. Biren. “SG 20 
Ss Sale: Sefei: Bafaig, Se /2ib: Sal zig, Bale Sal ene Se) 16 
Se SIGs Slee. Galois: alin Sifeie.. Silzia Sifeiw. Sia 


In order to code the system states, consider vw, and ug which generate 
the only distinct columns. Take uw, = 0. It can be verified that an 
acceptable coding is 


ee ee 
“ns 


For the output table, L = 16, 2 = 4. Calculating the upper partition 
of the C matrix, Ce, 


giving 


C.[v3 | v2] = [w3 | wa], 
or 
C, = [w3 | wé]. 


Take 


then, C, = I, or 


oclUlUCcOWlUWOlhlUM 
Co oF hl O 
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From the state sum, wi = (*), and since v, = 0, it follows that w/ = a 


Clearly, w, , w., w3, and w, are given by including two additional zero 
components. In order to code column 3 consider the output sum 


Wy, + Ws = We + W, = W3 + Wii = Wh + Wien. 


0 
Taking w; = : leads to 
0 
0 0 0 0 
0 0) 1 1 
Dg DE Wg a Wa rE Cae | ah ha | 
0 0 0 0 
Similarly, 
1 1 
_ |0 _ jl]. 
Wi = 1? Wi2 = 1 
0 0 
By taking 
0 0 
0 0 
Wr = |g and wy = 1 
1 1 


the remaining output vectors can be coded. 
Coding the first component of the input leads to 


[4 
LO 
With the results 


pik oe Oo] 
LO 0 0; 
Uy = wm =U =u =0, (36) 
and 
u=uy =u =u=1 (37) 
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The output vectors w. , w; , and w; span all sums of the form w; + w, 
(notice w, = 0) x = 2, 5, 6, 7, 8, 9, 10. Then, 


D{uz | ws | us] = [we | ws | w7] = 


Co FF © 


0 
0 
1 
0 


el > > >) 


0 


Considering equations 36 and 37, make a linearly-independent assign- 
ment of u., us, and u;. Say 











(0 0 (1 
w= 0 ; uz; = j1|, and u;, = |0| ; 
1 0 0 
which leads to | 
0 0 0 
D= 0 0 1). 
0 1 0 
10 0 


To determine ws, for example, set up the linear equations 





Diy = 264 (recall uf = 0) 
‘0 0 0 ) 
0 0 1 1 
Uy = ; 
0 1 0 ] 
il 0 O 0 
or 
fo) 
Uy = 1) 
1) 
Similarly, 
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5.2 The Moore Model 


For the Moore model all columns of the state transition table are 
distinct; therefore, the method for coding over k components of 1, 
introduced in the last section, can be applied directly. 


VI. LINEARITY AND INCOMPLETELY SPECIFIED SYSTEMS 


This section considers the problem of specifying “don’t care” entries 
in the state transition table in a way consistent with linearity; the 
results can be extended to the output table. 

It is obvious that if an incompletely specified table is to have a 
linear realization, then the entries in the table must obey the same 
relationships which were developed in the preceding sections. For 
example, the table must exhibit consistent state sums, and allow 
completion of the maximum memory process. The resulting restric- 
tions on the unspecified entries may be used to deduce their appro- 
priate assignments. 

Consider Table XI in which the “don’t care” next states are de- 
noted by the symbol ¢. 


TABLE XI 

0 1 
Sy t, So 
S2 S4 Se 
S83 Ss ts 
S4 So 84 
Ss S3 Sy 
So 8, 83 
87 te Ss 
Ss ts Ss 


The state sum is 
Ve tt =u ty = ty tvg =v, +03 = to tvg = tz tvs. (88) 
From the third and fifth terms, it follows that t, = ¢, . Forming 
A(u, + v3 + 04 + U6) 
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leads to 

Ve AH} tg tic = Ob; (39) 
Also, 

A(v, + v6 + v2 + t,) 
yields 


yYtutwytih = 0, 


where #, = At,. Locating v,, v, , and v, in the state sum leads to the 
conclusion ?, = v;. The present state which gives s; as its next state 
under zero input is s;. So that, ¢; = v;. The first and last terms of the 
state sum imply that t; = v,. From A(v, + vg + vs + t) obtain 


yYtuyrtept+h=vytwi+tw.+th = 0. 
Adding the last equation to equation 39 gives 
ietitwti=ov tote. tih = 0. 


However, equation 38 indicates t2 + vg = v2 + v; ;s0 that i, = é. From 
the table, 2 = v,. Table XII shows the fully specified table. 


TABLE XII 

0 1 
8) S5 So 
82 S4 S6 
S3 Sg 87 
S84 S86 S4 
Ss5 S83 8, 
Ss Sy S83 
87 S87 Ss 
Ss So S85 


APPENDIX 


An Addition to Theorem 2 


For N not equal to 2” there is a set of 2" — N vectors that cannot 
be used as state vectors. However, when the system is linear operat- 
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ing on any unused vector with the A matrix, and adding Bu, must 
produce a vector which is also an unused vector. If this were not 
the case, there would exist a state vector which would give rise to 
one of the unused vectors as a next state for some input. It follows 
that a transition table can be constructed containing only the 2” — N 
unused vectors. Let this system be called a virtual system. From the 
foregoing, it is clear that the states of the virtual system must obey 
the restriction of Theorem 2; that is, the number of distinct inputs 
cannot exceed 


2 ++ (2" == N) pers, 


t=2 
where ¢ is as before. Since the virtual and original systems must re- 
main disjoint, the original system must observe the bound. This is a 
smaller upper bound than obtained in Theorem 2. 
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Laser Machining of Thin Films 


and Integrated Circuits 


By M. I. COHEN, B. A. UNGER, and J. F. MILKOSKY 
(Manuscript received September 5, 1967) 


The feasibility of using the YAG laser as a tool to thermally machine 
integrated circuits has been studied. Results suggest that defining the resistor 
geometry, trimming the resistors to value, fabricating gap capacitors, 
and defining interconnecting circuitry might be performed by such a laser. 

Pattern generation by laser machining has been demonstrated on various 
thin and electroplated films. Vaporized lines (gaps) are readily attainable 
as fine as 0.25 mil in thin films and 0.4 mil in plated films. Much thinner 
lines may be obtained under particularly well-controlled conditions. These 
films may be removed with minimum effect to the substrate surface. The 
heat-affected region of the substrate may be confined to less than one film 
thickness. Better laser output control and shorter pulse widths will diminish 
this thickness. 

Gap capacitors have been made on sapphire substrates with capacitance 
approaching 20 pf in 0.04 square inches, and experiments suggest vm- 
provements. 

Tantalum films may be shaped to resistor geometries and trimmed to 
tolerance by removing metal or by oxidizing the resistive film with the 
laser. Resistors usually can be trimmed to tolerances of less than +0.1 
per cent. 

With further development it might be possible to combine these laser 
machining processes into a single-step, automated fabricating procedure 
for certain types of integrated circuits. We review some of the technical 
aspects of this and discuss using Q-spoiled YAG lasers to directly machine 
masks for photoetching. 


I. INTRODUCTION 


The steps to fabricate thin film passive elements and interconnecting 
circuitry on hard substrates are well documented. ? These procedures, 
defining resistor geometry and interconnecting circuitry, resistor trim- 
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ming to value, and capacitor fabrication, are the essential steps for 
producing precise integrated circuits. It is our intention to show that 
a laser can be used to vaporize, in a controlled manner, thin film 
structures, and that it therefore might be capable of supplementing or 
performing these four processes; or it might be used to supplement 
photolithography by directly machining thin film masks for making 
circuits by photochemical methods. 

We studied methods for performing these steps with a continuous 
neodymium doped yttrium-aluminum-garnet (YA“fG:Nd or YAG) 
laser. Laboratory work has demonstrated the feasibility of fabricating 
circuit building blocks and eliminating many steps for film structures 
which are common to tantalum integrated circuits; that is, circuits 
in which the resistive films are compounds of tantalum. Similar 
techniques will be applicable to other types of circuits. 

Tantalum thin film resistors have been shaped and trimmed to 
value with a laser beam. A controlled Jaser has removed various 
combinations of thin films from substrates without adverse effects 
to the substrates. Tantalum films have been thermally oxidized with 
the YAG laser as the heat source. Controlled parasitic or gap capaci- 
tors have been made with specific capacitance up to 4.5X10-* pf per 
square mil by laser machining narrow lines across thin films. 

The YAG laser has demonstrated considerable potential as a thin 
film machining tool. The simplicity of operating and controlling the 
device, and the characteristics of its output beam render it particu- 
larly well suited to this application. 


II. GENERAL CONSIDERATIONS 


2.1 Characteristics of Lasers as Fabricating Tools 


The utility of a laser as a tool for fabricating thin film circuits 
results primarily from the spectral purity and degree of collimation 
of the laser light. These characteristics allow the beam to be focused 
to a very fine and intense spot. The high heat flux which occurs when 
the light is absorbed by the target material, and the sharp definition 
and localized nature of the working region allow heating, melting, or 
vaporizing minute amounts of material, with minimum effect to ad- 
jacent material or components. 

Among other useful characteristics of light as a working tool is 
its small absorption depth in metallic materials. This property renders 
the laser particularly applicable to operating on thin materials such 
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as films without damaging the substrate or material beneath the film. 
In addition, energy may be transferred to the workpiece through an 
optically transparent material or atmosphere, and without physical 
contact with the workpiece. Encapsulated or otherwise inaccessible 
parts may be machined, and working regions may be kept free of 
contamination which might result from contact with a tool. Refer- 
ences 3 and 4 give more detailed discussion of the suitability of lasers 
for fabrication. 

Many types of lasers have been shown to be applicable in processes 
related to thin-film circuit fabrication. Pulsed lasers have been used 
to vaporize slots and lines in metallic targets® and have been included 
in an experimental automated procedure for trimming thin-film 
resistors. Pulse-pumped He-Ne gas lasers have been used to scribe 
lines on metals.7 Reported linewidths are 12.5 microns. Ionized argon 
lasers operating at one-half watt cw output have been used to scribe 
lines as fine as 10 microns in iron-nickel films deposited on glass.® 

The neodymium-doped yttrium-aluminum garnet (YAG) laser dis- 
cussed by Geusic and others,>™ is particularly well suited to thin- 
film fabrication because of its good combination of such character- 
istics as the adequate intensity, stability, and optical quality of the 
output beam, and its simple and compact design. Such YAG lasers 
may be operated continuously at about 1 watt output, or may be 
repetitively Q-switched by rotating the rear reflector. The Q-switched 
output is a continuous train of pulses with peak power exceeding 
1 kw, and pulse duration about 200 ns when the repetition rate is 
400 cps. Both types of operation may be obtained by using as a 
pump source an inexpensive lamp powered directly from line current. 
In both cases the laser may be adjusted to oscillate in a sufficiently 
low order mode that the output beam may be focused conveniently 
to the fine spot needed for precise thin-film machining. 


2.2 Machining Thin Films 


The processes that we studied, except thermal oxidation, use the 
laser’s ability to vaporize material. It is desirable, therefore, to dis- 
cuss briefly a few of the parameters and phenomena of material re- 
moval by laser. Cohen and Epperson give more detail.® 

We notice first that it is the optical power density in the focused 
spot rather than the laser power output, itself, that determines the 
suitability of a laser for removing material. Greater power densities 
often may be obtained from a laser that oscillates in a low order 
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mode, than from a higher-powered multimoded device. For the useful 
case of a laser oscillating in the fundamental Gaussian mode, the 
minimum focal spot radius, w;, may be determined from the relation- 
ship. (See Ref. 3.) 





Iii ~ Gta) io 
-4(.-@49)4 a, (1) 
where, f is the focal length of a lens with sufficiently great aperture 
to admit the entire beam, wo and zp are parameters which depend 
upon laser cavity configuration, d represents the distance of the focus- 
ing lens from the cavity, and @ is the far-field beam divergence angle. 
In many metal-working applications, including the one we are dis- 
cussing, the lens is sufficiently far removed from the laser output 
reflector that equation 1 becomes 


: Af 
ee 130 Or ew @) 
where w is the radius of the beam as it enters the lens. Equation 2 
predicts, therefore, that spot sizes of the order of wavelength, A, may 
be obtained with lenses having small f numbers (that is, small ratios 
of focal length to aperture). . 

The size of the affected zone in the target material will depend 
on the thermal properties of that material as well as the optical spot 
size and the intensity distribution across the spot. Edge definition 
of the affected zone depends primarily on thermal properties of the 
target and the duration of exposure. Metals with high thermal dif- 
fusivity and a large difference between melting and vaporizing tem- 
peratures, such as gold and copper, tend to develop a lip formed by 
molten metal around the region from which material was removed. 
The lip may be minimized by using a very short exposure such as that 
which might result from operating the laser in the Q-switched mode. 

The use of Q-switched output also is desirable to minimize thermal 
damage to the substrate. Damage results both from heat conducted 
from the film into the substrate, and from direct impingement of the 
focused laser light on the substrate after the film has been removed. 
Using laser pulses of high peak power and short duration substantially 
decreases both effects.* 

Many metals reflect an appreciable portion of the incident laser 
light. Such metals might need much higher laser output levels than 
nonreflecting materials. In most laser micrometalworking processes, 
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the surface remains solid and reflecting for only a small portion of 
the laser pulse duration. The reflectance of the surface may decrease 
abruptly when it melts or reacts with its atmosphere, and subsequent 
absorption will occur with greater efficiency. The initial laser output, 
however, must be sufficient to break down the surface. 


2.3 Lxeperimental Apparatus 

Figs. 1 and 2 show the apparatus used in our experiments. The YAG 
laser was Q-switched by rotating the rear reflector at 400 hertz. The 
output pulse parameters at the rated voltage of the pump lamp (120 
V) were about 1 kW peak power and 200ns duration at the half 
power point. Output, at 1.06 micron wavelength, was monitored by 
means of a photomultiplier tube behind the cavity, and the photo- 
multplier frequently was used in conjunction with a thermopile which 
measured the mean power of the output beam, so that the peak power 
could be calculated. The laser output was attenuated by neutral den- 
sity filters. Laser mode pattern was observed by means of an image 
converter tube, and the laser cavity was adjusted periodically so 
that most of the output was contained in the fundamental mode. 

An x-y-z micropositioner was used to focus and move the workpiece. 
For applications such as line scribing, in which the work is moved 
across the beam, one of the micrometer barrels is rotated by means 
of an hydraulic drive coupled to a gear and belt mechanism. A wide 
range of continuously-variable speed was available. The maximum 
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Fig. 1— Schematic diagram of laser machining apparatus, 
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permissible sweep speed, however, was dependent upon the diameter 
of the focal spot and the stability of the laser output. With excessive 
sweep speed, successive laser pulses did not overlap, causing scallop- 
ing at the edge of the line. Speeds of 0.1 inch per second were per- 
missible with 0.0002-inch gaps, and correspondingly lower speeds were 
required with smaller gaps. Working atmosphere generally was air, 
although limited tests suggested that somewhat better edge definition 
is possible in oxygen-rich environment. 

The viewing and focusing device consisted of two partial mirrors 
and an eyepiece mounted on a traverse mechanism, and a separately- 
mounted microscope objective. Objectives with 23 mm focal length 
(10X) were used for most of the applications studies associated with 
direct machining of tantalum integrated circuits, and lenses with 
focal length as short as 4 mm (approximately 70X) were studied for 
thin-film mask-making application. 

The 23 mm objective was a particularly good lens for general- 
purpose work. Its depth of focus was sufficiently great (approxi- 
mately -£0.001 inch) that fine lines of appreciable length (greater 
than 0.5 inch) usually could be scribed with good uniformity on 
nonuniform surfaces such as that of a glazed alumina substrate with- 
out elaborate alignment of the workpiece. The spot diameter, 2 wy, 
calculated from equation 1 and the parameters appearing in Fig. 1, 
was about 8 microns (about 0.0003 inch) for this lens. Increasing 
the strength of the objective to about 40X will decrease the cal- 
culated spot size to about 2 microns, but the severe loss of depth of 
focus restricts use of such lenses to very flat and well-aligned targets. 
Such lenses with short working distances also necessitate careful at- 
tention to laser output intensity in order to prevent lens damage due 
to the laser plume of vaporized material. 

The apparatus described has been used to vaporize spots and scribe 
lines in a wide variety of thin-film structures. Figs. 3 through 5 
show some of the general characteristics of such lines. 

Figure 3 indicates the effect of laser beam intensity and stage sweep 
speed on width and definition of lines vaporized in a gold thin film 
(approximately 3000 A) on sapphire substrate and a nichrome film 
(2000 A) on quartz. An 8 mm lens was used to scribe the gold film 
and a 4 mm lens was used for the nichrome. The finest lines were 
about 0.00025 inch wide for the gold and 0.0001 inch for the ni- 
chrome. Such gaps generally contain no metallic debris, as evidenced 
by the very low electrical conductance that is measured across them.? 
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# 





GOLD 
amm LENS 
125 x 
A B c D 
LINE NUMBER 123 4 12 3 4 123 4 1293 4 5 


MACHINING SPEED 
(INCHES PER SECOND) 


AVERAGE LASER 


0.100 0.062 0.031 0.010 





POWER (mW) 76 96 125 141 76 96 125 141 76 96 125 141 76 96 125 141 54 
NICHROME 
4mm LENS 
125x 
A B Cc D 
LINE NUMBER a ells ee 123456 
MACHINING SPEED 
(INCHES PER SECOND) mie 0.062 0.031 0.010 
POWER (mW) 4.2 6.212 2460 4.26212 24 60 38 4.2 6.2 12 2460 38 4.2 6.2 12 24 60 


Fig. 3— Effect of machining parameters on lines machined in 3000 A gold 
film on sapphire and 2000 A nichrome film on quartz. 


Lines as fine as 0.00004 inch (1.0 micron) have been scribed in tan- 
talum nitride, titanium, and nichrome films. 

It has been shown that for thin film samples similar to those shown 
in Fig. 3, the extent of damage to the sapphire substrate may be 
minimized by proper control of power density and sweep speed.? Fig. 
4, representing a typical cross section through such a sample, indicates 
that the extent of the affected zone may be limited to a depth ap- 
proximately equal to the film thickness. Recent results have shown 
that some metal films may be removed with no substrate damage 
observable by optical means. 

Fig. 4 also suggests the presence of a small gold lip bounding the 
laser-machined gap. The size and nonuniformity of the lip, and the 
depth of the thermally affected zone in the substrate increase as the 
film thickness is increased. Fig. 5 shows typical depth of penetration 
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into the sapphire when the gold thin film is plated with approximately 
0.3 mil of copper and then 0.01 of gold. Although gaps as narrow as 
0.00020 inch have been machined in such films with a 23 mm lens, 
gaps smaller than the film thickness require precise control of laser 
and stage parameters. The lateral extent of the heat-affected zone 
resulting from thermal conduction in YAG laser-machined thick 
films often is the same as the film thickness. 

Results similar to these have been obtained when the substrate 
material is glazed alumina, quartz, or silicon rather than sapphire, 
although the body of data for these materials is not nearly so large 
as it is for sapphire. Damage to the surface, apparently caused by 
melting, is confined to a depth less than the film thickness. Cracking 
of the glaze in the vicinity of the working zone has not been observed 
when laser parameters have been adjusted properly. 


III. DIRECT MACHINING OF TANTALUM INTEGRATED CIRCUITS 
3.1 Gap Capacitors 


3.1.1 Characteristics 


Interelectrode effects normally are present in miniaturized or high 
density integrated circuits. However, these parasitic or stray capaci- 
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Fig. 4— Cross section through gap machined in 3000 A gold film showing the 
affected zone in the sapphire substrate. 
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tances usually are minimized by judicious spacing or isolating the 
components. These parasitics, gap capacitors as we call them in this 
article, can be produced with values high enough for use as discrete 
circuit elements by varying the spacing and length of the coupling 
electrodes, and the dielectric constant of the surrounding media. 
Kaiser and Castro have calculated the interelectrode effects be- 
tween two thin film conductors deposited on a substrate.? The cal- 
culations were directed at predicting parasitic capacitances between 
parallel conductors on substrates of various dielectric constants. The 
analysis was based on a model of two parallel conductors a distance 
d apart, of equal width 1, on the same side of a substrate with dielec- 
tric constant k, of finite substrate thickness t, and of infinite extent. 
It has been assumed that the dielectric constant of the environment 
is negligible compared with that of the substrate. Fig. 6 presents, for 
various d/t and l/t ratios, calculations based upon this analysis. 
Fig. 6 shows experimental data for some gap capacitors fabricated 
in the laboratory by means of a Q-switched YAG laser. Gap width 
was varied from 0.0005 to 0.025 inch for thin chrome-gold films on 
0.025 inch thick barium titanate substrates (dielectric constant ap- 
proximately 500), and 0.0003 to 0.015 inch for tantalum-chrome-gold 
thin film composite on 0.03 inch sapphire. In the case of the high 
dielectric substrate, agreement with numerical calculations is best for 
the larger gaps. Behavior for the fine gaps probably is affected by the 
granular and nonuniform nature of the substrate, and further study 
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Fig. 5— Cross section through gap machined in plated thin film showing the 
iffected zone in the sapphire substrate, 
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Fig. 6 — Capacitance of gap capacitors as a function of gap width. Comparison 
of analytical and experimental results. 


is necessary. The shape of the curve for sapphire is similar to that 
predicted by the analytical model. The displacement between the 
curves may be explained by considering the fringing capacitance 
through the environment. 


3.1.2 Laser Fabrication 


A Q-switched YAG laser was used to machine fine lines in thin 
film conductors to form the gap capacitors. The effects of gap width, 
line length and geometry have been investigated. Most of the data 
were obtained with gap capacitors on single crystal polished sapphire 
substrates. Limited data also have been obtained for other substrate 
materials. 

Fig. 7 demonstrates the linear relationship that has been observed 
between capacitance and gap length. There is no observed effect of 
bending the cut into a serpentine configuration until, as will be 
shown later, the parallel legs become sufficiently closely spaced. These 
results suggest that there is no significant contribution due to the 
field concentration at the corners, since such an effect would cause 
a deviation from linearity. 

Most of the data in Fig. 7 were taken on tantalum-chrome-gold 
thin film composites on sapphire. Some data also are included for gaps 
machined in 0.3 mil thick copper-plated conductors. Gap width in 
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Fig. 7— Capacitance of gap capacitors as a function of gap length. 


both cases was 0.5 mil. The plated films appear to produce capacitance 
values similar to those for unplated thin films. 

The response of gap capacitors to frequency was determined by 
measuring a gap capacitor at frequencies to 4 GHz. A plated 0.5 mil 
wide gap capacitor cut in a plated 50 ohm transmission line on an 
alumina substrate (Fig. 8) was measured with the following results: 





Fig. 8— Gap capacitor laser machined in plated 50 ohm transmission line on 
glazed alumina substrate. 
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At frequencies of 465 kHz, 1 MHz, 3 GHz, and 4 GHz, the meas- 
ured capacitance was 2.49, 2.48, 2.65, and 2.49 pf, respectively. The 
response of a laser machined gap capacitor appears relatively flat 
up to 4 GHz. 

A further study was made of the effect of gap width on capacitance 
value. Fig. 9 shows the results of a tantalum-chrome-gold composite 
film on a polished sapphire substrate, and of a chrome and gold 
composite on an unglazed barium titanate substrate. Calculated points 
from Fig. 6 are indicated to show the agreement with the analytical 
model. The data indicates that the gap capacitors are insensitive to 
changes in gap dimensions when the gap is a few mils wide, but very 
sensitive when the gap is made less than a mil across. These results 
suggest that in order to achieve the highest possible specific capaci- 
tance it is desirable to achieve the smallest possible gap width. On 
a normalized plot, such as Fig. 9, the elements are also insensitive 
to substrate dielectric constant and the dielectric properties of the 
test environment. Gap capacitance therefore can be presented on a 
normalized basis with some constant as a dielectric scaling factor. 

We show in Fig. 7 that capacitance varies linearly with length for 
straight-line gaps. To achieve a high capacitance per given area, 
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Fig. 9—- Capacitance of gap capacitors as a function of gap width. 
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however, the gap may be fabricated in a serpentine geometry. Fig. 
10 shows a portion of a serpentine capacitor with 0.5 mil gap and 
5 mil spacing. Fig. 11 shows the results of capacitance as a function 
of total line length for serpentine gap capacitors of equal heights 
on a conductor 0.225 inch wide. The curve is linear down to about 
a 50 mil spacing between the parallel legs. As this dimension de- 
creases, the relationship between capacitance and line length becomes 
nonlinear. These results suggest the interaction of fields remote from 
the gap edges that significantly affects the value of the element. 

Specific capacitance is one means of comparing different thin film 
capacitors. A specific capacitance can be given for gap capacitors 
provided the substrate dielectric constant and gap width are stated. 
Laser machined gaps 0.5 mil wide have been made in films on 200 
mils-square sapphire substrates (IK ~ 10) in a serpentine fashion 
with values up to 18 pf. This corresponds to a specific capacitance of 
45x10 pf per square mil. It is estimated that this value can be 
increased to about 1x10 pf per square mil by adjusting the ser- 
pentine geometry and decreasing the gap width to 0.2 mil. Such gap 
widths have been machined in thin films with the YAG laser when 
the optical alignment and beam control have been precise. With care- 
ful control of techniques and equipment, repeatable gap widths of a 
few microns or less are practical. 





Fig. 10 —Serpentine gap capacitor on sapphire substrate. Gap width 0.5 mil, 
separation between legs 5.0 mils. 
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Fig. 11—Capacitance of serpentine gap capacitors as a function of total 
length of gap. 


Leakage currents less than 10° amperes at 40 V are normal for 
these units. These values indicate that there is very little debris 
remaining in the gap. 


3.2 Trimming Thin Film Resistors 


Two techniques for increasing the resistance of tantalum nitride 
resistors with the YAG laser have been explored. Initial studies have 
established the feasibility of trimming to value by removing material 
and by thermal oxidation. 

Trimming by removing material may be accomplished either by 
changing the dimensions of the resistor or by vaporizing small spots 
in the interior of the resistor. Fig. 12 shows a resistor whose value 
has been increased 19 per cent by means of a series of internal spots, 
each vaporized with a 1/50-second exposure to the Q-switched beam. 
Included in Fig. 12 are the results of resistance measurements taken 
on this resistor. 

Changes in resistance from less than 0.01 to over 0.1 per cent per 
spot have been demonstrated by varying the size and location of the 
spot on the resistor. Fig. 138 demonstrates typical resistance changes 
for a resistor, one of whose edges has been removed progressively in 
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Fig. 12 — Resistor trimming by spot material removal. 


0.0001-inch steps. With proper instrumentation and control, either 
the spot or edge laser vaporization method could provide a rapid 
means of trimming resistors to better than 0.1 per cent. We notice 
that resistors trimmed by material removal change less than 0.1 per 
cent in resistance after such resistors are exposed to the laboratory 
environment for several months. These results suggest the temporal 
stability of laser trimmed resistors. A definitive measure of stability, 
however, will necessitate a power aging test under controlled condi- 
tions. 

Our studies of thermal oxidation were exploratory because a proper 
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quantitative investigation will require a laser with sufficiently high 
output to achieve the required power density over a spot large enough 
to completely expose the entire resistor. In the present study, a spot 
about 0.001 inch across was used and the resistor surface (0.005 inch 
square) was swept past the beam. A uniform brown color resulted, 
accompanied by a change in resistance from 16.96 to 17.98 ohms. The 
same color and increase in resistance may be obtained by wet anodiza- 
tion to a potential of about 20 V. 

A much greater increase in resistance (for example, 16.29 to 38.26 
ohms) was obtained by further exposure. The surface, however, ap- 
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Fig. 13 — Resistor trimming by edge material removal. 
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pears nonuniform in color, indicating the varying degrees of oxida- 
tion. As treatment progressed, the process appeared to lose stability 
in the sense that it required greater attention to laser output and 
stage sweep speed to prevent the resistor from burning through locally. 
The indication is, however, that both uniformity and stability will 
improve when larger laser spots are used. 

We noticed that oxidation of tantalum areas adjacent to gold con- 
ductors is possible without damage to the gold. The high reflectivity 
of the gold raises the damage threshold of gold well above the power 
required for converting the tantalum to oxide. 


3.3 Miscellaneous Applications 


The YAG laser may be used to remove metal films in order to de- 
fine interconnecting circuitry. Such pattern generation requires no 
considerations other than those already discussed. With the present 
laser the small spot size necessitates many passes by means of an 
index-and-repeat method, in order to remove large areas of film. Such 
a method would be more economical if the present YAG laser were 
Q-switched at great speeds so as to allow correspondingly greater 
sweep speeds. An alternative would be to use a higher powered laser 
to define roughly the circuit dimensions, and then to trim precisely 
with the present laser. 

The initial results suggest the possibility of fabricating, in one step, 
a complete integrated circuit with all the passive elements. Such a 
process would start with a metallized substrate and would use a 
programmable laser and work stage. Complete laser fabrication of 
hybrid circuits will require a process in which a metal film is removed 
selectively, exposing a different film. For example such a process may 
be necessary in order to remove the conductor and expose the resistor 
film. 

In the present tantalum-chrome-gold technology such a selective 
removal of the gold presents substantial difficulties because the reflec- 
tivity of the gold is much greater than that of the tantalum nitride. 
It is quite probable, however, that some combination of films will be 
found for which the upper film can be removed from the resistor with- 
out damaging it. 


IV. MACHINING THIN-FILM MASKS 


The machining of masks involves considerations similar to those 
for pattern generation. Experience has indicated that a variety of 
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sputtered and deposited films (1000—3000 A) on quartz, glass, and 
sapphire may be machined conveniently. Patterns machined in tita- 
nium and nichrome films, however, appear to have superior edge 
definition and uniformity of transmitted illumination compared with 
patterns produced in other films. Mechanisms governing the behavior 
of the metal film during laser machining have not yet been defined, 
but appear to depend upon the surface tension and bonding charac- 
teristics of the film as well as the thermal properties of both film and 
substrate. 


V. SUMMARY 


The feasibility of machining resistive and capacitive components 
directly on thin film metallized substrates with a laser has been demon- 
strated. Tantalum films can be shaped into resistor geometries and 
trimmed to tolerance by removing metal. These films also can be 
oxidized to value using the laser beam as the heat source. Resistors can 
be made with tolerances in value of less than +-0.1 per cent. 

Gap capacitors have been made on sapphire substrates with capaci- 
tance measuring up to 20 pf in an area of 0.04 square inch with 0.5 
mil gap spacing. Limited studies have indicated that these elements 
are stable with time, have leakage currents at 40 V of less than 10~° 
amperes and do not change significantly with frequency up to 4 
GHz. Variation of capacitance with gap width has been studied and 
experimental results show good agreement with numerical results 
based on an analytical model for determining parasitic capacitance. 

Pattern generation by laser machining has been demonstrated on 
various thin films as well as on electroplated films. Vaporized lines 
as fine as 0.25 mil are readily attainable in thin films, as are 0.4 mil 
lines in plated films. Much narrower lines may be obtained under 
particularly well-controlled conditions. Uniform lines as fine as 1 
micron have been scribed in thin films on sufficiently flat substrates. 
These films have been removed with minimum effect to the substrate 
surface. 

The present work has been accomplished with a Q-switched YAG 
laser operating with a repetitive output, at 400 hertz, of 1 kw peak 
power and 200 ns pulse duration. Advantages of such a laser as a 
machining tool include the optical quality of the output, and simplicity 
and economy in both design and operation. Maximum machining speed 
presently approaches 0.1 inch per second for 0.2 mil gaps, and a further 
increase of one to two orders of magnitude may be expected as the 
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repetition rate of the laser is increased. Continued development of the 
YAG laser, particularly with regard to decreasing the width of the 
pulse, is expected to provide for increased definition of the working 
zone, improved selectivity in removing films of various metals, and a 
further decrease in the already small damage to the substrate. 

Our study has been exploratory and has served only to establish 
the feasibility of machining thin film circuits with existing lasers. 
Further attention to the details of the specific machining processes as 
well as to the combination of these processes into an automated pro- 
cedure is necessary in order to evaluate their practicality. It is neces- 
sary, also, to better define the mechanisms governing laser material 
removal processes in order to realize fully the potential applicability 
of laser machining of thin film circuits. 
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Performance Degradation 
by Postdetector Nonlinearities 


By GEORGE H. ROBERTSON 
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This article gives the performance degradation found by using a computer 
program to calculate the effects of various processing techniques applied 
after envelope detection of narrowband Gaussian noise plus a CW signal. 
Nonlinear processes that we studied are: suppression of low output levels, 
hard limiting of high output levels, restricted dynamic range of output 
level, and quantizing the detector output. 

By modifying the program, we produced performance curves for systems 
in which a steady CW component was present in addition to possible 
signals. We also adapted the program to generate performance curves for 
systems using a square-law detector so that a comparison could be made 
with the results obtained for an envelope detector. We found that the presence 
of a CW component, and all the nonlinear processes applied after envelope 
detection, caused a loss of sensitivity in detecting small CW signals. Quan- 
tizing even in as few as 8 levels, however, caused very little additional 
degradation. 


I. INTRODUCTION 


A computer program was developed that would draw receiver op- 
erating characteristic curves for a system using an envelope detector 
‘to search for CW signals in narrowband Gaussian noise.t The pro- 
gram computed the probability that the detector output would exceed | 
a chosen threshold under two circumstances: 


(z) No signals were present; this result gives the probability of 
false alarm, Pra. 

(it) Signals at various S/N were present; this result gives the prob- 
ability of detection, Pp. 


S/N is the ratio of the CW signal power to the noise power ac- 


* The U. 8. Navy supported this work under contract N600(63133)64940, 
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companying it in the specified narrow band. These probabilities were 
calculated using formulas which described the appropriate distribu- 
tions of the detector output. 

The program also averaged independent samples of the detector 
output, and produced appropriate receiver operating characteristic 
curves by deriving a Gram-Charlier A series for the distribution of 
the sample average. 

When the program was adapted to take into account the modifica- 
tions of the detector output caused by various subsequent nonlinear 
processes, receiver operating characteristic curves were produced for 
systems in which such processes occurred. The performance degrada- 
tion was determined as the change in S/N that would be required to 
make the detectability of a signal, for given probability of false alarm, 
the same as without the nonlinear process. 


II. DISCUSSION 


Since some types of nonlinear processing greatly reduce the labor 
of handling large amounts of data, and others are unavoidable, or 
their effects are costly to minimize, it is useful to know the penalties 
on performance that are incurred by their presence. 

The types of nonlinear process studied are: suppressing low output 
levels, hard limiting of high output levels, restricting dynamic range 
of the output level, and quantizing the output of the detector. 

Suppressing low output levels occurs in some kinds of recording 
equipment where the output must reach a minimum level before a 
record can be made. High output limiting always occurs because the 
power handling capacity of physical equipment is limited. Dynamic 
range is restricted when the first two processes occur together; this 
also represents a commonly used technique in which a fluctuating out- 
put is converted to.a binary waveform with respect to a chosen 
threshold. Quantizing is particularly useful when a digital computer. 
is used to implement the statistical analysis. With this application in 
mind quantizing was evaluated in combination with restricted dynamic 
range 

For simplicity in programming, we assumed that the unit used to 
process the output of the envelope detector had an appropriate trans- 
fer characteristic from the set shown in Fig. 1. It can be seen that over 
part of its operating range this unit has a linear characteristic. Quan- 
tizing would cause the slope of the linear region in Fig. 1(c) to be 
replaced by a staircase. 

In addition to the changes needed to allow investigation of the 
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Fig. 1— Transfer characteristics of post detector processor. (a) Suppression of 
low levels, (b) hard limiting of high levels, (c) restricted dynamic range. 


above processing techniques, we made alterations in the program so 
that it would produce receiver operating characteristic curves for sys- 
tems using a square-law detector, and for systems in which an un- 
wanted CW component occurred in the specified narrow band in ad- 
dition to the one representing a signal. 

The latter case is typical of situations where a large, relatively 
stable, CW component tends to mask the presence of a smaller stable 
signal in the resolved band of a spectrum analyser. The results show 
how effective in such cases is a decision process similar to that de- 
scribed in Ref. 1, for detecting a small signal. 

The distortions produced by some of these processes on the detector 
output distributions are so radical that many samples must be aver- 
aged to get an adequate Gram-Charlier representation even using up 
to 31 moments, the limit available in the program. Consequently a 
large number (8192) of samples were averaged before deriving the 
Gram-Charlier series from which the receiver operating characteristic 
curves could be produced. We made some checks where it was pos- 
sible to produce curves for fewer samples averaged. We marked these 
on the figures; they show only slight deviations from the results ob- 
tained by averaging 8192 samples. 

Since the normal distribution becomes a satisfactory approxima- 
tion as the number of samples averaged grows large it may be in- 
ferred that the degradation caused by nonlinear processes could be 
estimated by averaging enough samples and using the normal ap- 
proximation. For some of the processes described here the receiver 
operating characteristic curves for averaging 8192 samples showed 
perceptible variation from the results obtained by a normal approxi- 
mation at Py, values'less than 10-*. Thus, to ensure the validity of 
a normal approximation it might be necessary to average a very large 
number of samples indeed. 

When the results for one of the simulated nonlinear processes were 
plotted on the appropriate chart of Ref. 1, the variation of the degra- 
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dation from the average was never found greater than 10 per cent and 
usually within a few per cent. Consequently the average degradation 
shown in the accompanying figures represents a good approximation 
within the range covered by the appropriate chart of Ref. 1. 

Fig. 2 shows the degradation in performance when low output levels 
are suppressed, and when high levels are hard-limited. The position 
at which the levels are truncated is given along the abscissa as a 
multiple of the mean value of the ideal detector output when only 
noise is present (Rayleigh mean). The ordinate gives the change in 
S/N required with the distorting process to give the same performance 
as can be achieved without it. 

Fig. 3 shows the degradation sustained when the dynamic range 
factor of the detector output is 10 (solid line) and 2 (broken line). 
The dynamic range factor is the ratio of the high end of the linear 
range to the low end for a transfer characteristic like that in Fig. 1(c). 
Fig. 3 also gives the degradation sustained when the output is vir- 
tually converted to a binary signal by limiting the dynamic range 
factor to 1.001 (dotted line). The lower truncation point is shown 
on the abscissa and the ordinate gives the degradation in dB. 


III. QUANTIZING 


When a digital computer is used to implement the statistical anal- 
ysis of the detector output, it is important to know the maximum 
number of bits that need to be used to encode the output level. Three 
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_ Fig. 2— Degradation caused by (a) suppressing low output levels and (b) hard 
limiting high output levels. 
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Fig. 3 — Degradation resulting from limited dynamic range. 


bits allows coding into 8 levels, and four bits allows coding into 16 
levels, and so on. If three bits are used, for example, the range of 
possible detector outputs must be divided into 8 regions. The highest 
region will include all levels that would equal or exceed the high end 
of a characteristic such as that in Fig. 1c, and the lowest region will 
include all levels below the low end. The remaining six regions will 
be between the two truncation points. The computer simulation was 
carried out by dividing the detector output in this way and assigning 
the value of the lower limit of any region to all the values in it less 
than the upper limit. 

From Fig. 3 it can be seen that if a dynamic range factor of 10 is 
used and the low end is placed at three tenths of the Rayleigh mean, 
very little degradation in performance is suffered. When this range 
was quantized into 6 equal steps it was found that the total degrada- 
tion was only about 0.01 dB more than the value shown in Fig. 3. 

Two other ways of dividing the range into 6 unequal steps were 
tried, but the results were poorer than that for equal steps. In the first 
of these the range between the low end and high end was divided into 
6 intervals in such a way that the change in probability density was 
the same between each. The degradation in performance was about 
0.1 dB greater than that using 6 equal steps. The other way of quan- 
tizing the range divided it into 6 intervals of constant change in 
cumulative probability. The degradation in this case was 0.39 dB 
greater than that using 6 equal steps. 


IV. CW INTRUDER 

Fig. 4 shows that the system performance falls off rapidly owing 
to the presence of a CW component as the level of such a component 
increases from —10 dB with respect to the noise in the narrow band. 
The distribution curves corresponding to the no-signal hypothesis in 
this case were those for Gaussian noise plus a CW component at the 
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—*Fig. 4 = Degradation resulting from added CW signal. 


S/N specified on the abscissa. The curves corresponding to the hypoth- 
esis that a signal was present were those for Gaussian noise plus 
a resultant sine wave component. The power of the resultant equalled 
the power of the small signal plus the power of a large CW compo- 
nent at the level specified on the abscissa. The results thus represent 
the average for all possible phase relationships between the CW 
component and small signal assuming that all phase values are equally 
likely, and many samples of the detector output are averaged. 


V. SQUARE-LAW DETECTOR 


It was found that satisfactory Gram-Charlier series approxima- 
tions to the output distribution of a square-law detector could be 
produced only when a few hundred or more samples were averaged. 
Consequently subprograms were written to compute values for Chi- 
square and noncentral Chi-square distributions over the desired ranges 
of S/N and samples averaged. Using these and the results reported in 
Ref. 1 it was possible to produce the curves shown in Figs. 5, 6, and 7. 
which enable the receiver operating characteristic curves of Ref. 1 
to be used to estimate the performance for a square-law detector 
with quite good accuracy. Fig. 5: compares the performance of linear 
and square-law detectors at’a false-alarm probability of 10-* and 
three different detection probabilities when the number of samples 
averaged ranges from 1 to 8192. The approximate S/N corresponding 
to a. Pp, Pra pair is also given over the range of samples averaged. 
Curve 1 in Figs. 6 and 7 compares the detector performance for the 
same S/N as curve 1 in Fig. 5, but when the Pra is 5 X 10-*, and 0.09, 
respectively, and curve 2 in Figs. 6 and 7 do the same for the S/N 
that applies to curve 2 in Fig. 5. | | 
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VI. CONCLUSION 


We have given curves which show that the sensitivity for detecting 
small signals using an envelope detector is degraded when any of 
several common kinds of nonlinear processes occur between the detec- 
tor output and the averager used prior to the decision threshold. 
When only one sample of the detector output is used to form a deci- 
sion, the system will be oblivious to the presence of the nonlinear 
process as long as the decision threshold lies within the linear range. 
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Fig. 6 — Comparison of linear and square-law detectors, Pra = 5.10-4. 


414 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1968 


DECIBELS 


SQUARE-LAW 
BETTER 


LINEAR BETTER | 





O 2 4 6 8 10 12 14 
LOG> (M) 


Fig. 7— Comparison of linear and square-law detectors, Pra = 0.09. 


The system performance in such a case is thus restricted to threshold 
variations within this range. 

It is interesting that J. V. Harrington has analyzed the detection 
of repeated signals in noise using binary integration.2 The system he 
assumed corresponds closely to the one discussed here for which Fig. 
3 shows the results. Harrington deduced that the optimum position 
for the quantizing threshold was about 1.44 times the Rayleigh mean, 
in which case binary integration would give results 0.77 dB poorer 
than if ideal processing with a linear detector had been used. These 
numbers are in excellent agreement with the minimum degradation 
condition shown in Fig. 3. 

We have shown that the presence of an unwanted CW signal, even 
—10 dB with respect to the narrow-band noise, causes some degrada- 
tion of the sensitivity for detecting small CW signals, and the degrada- 
tion increases rapidly as the level of the unwanted component rises 
from there. 

Computation shows that there is little performance difference to 
be expected from the use of envelope or square-law detectors. Envelope 
detectors would thus seem preferable in view of their greater simpli- 
city, especially when a considerable dynamic range is to be covered. 
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Phase Principle for Measuring 
Location or Spectral Shape of 
a Discrete Radio Source 


By A. J. RAINAL 
(Manuscript received May 19, 1967) 


This paper describes a phase principle for measuring the location or 
the spectral shape of a discrete radio source. The phase principle ts relatively 
simple to implement and leads to a measurement of location or spectral 
shape which is insensitive to receiver gain fluctuations. For measuring 
the location of a weak, discrete radio source, the theoretical accuracy ts 
slightly better than the theoretical accuracy resulting from the Ryle inter- 
ferometer. For measuring the spectral shape of a weak, discrete radio 
source, the theoretical accuracy is slightly better than the theoretical ac- 
curacy resulting from either the Ryle interferometer or the Dicke radiometer. 
Furthermore, the implementation of the phase principle doesn’t require 
input switching. Also, the calibration curve associated with the phase 
principle is independent of changes in the average receiver gains of the 
two receivers. 


I. INTRODUCTION 


In many branches of science and technology observations of a dis- 
crete radio source provide fundamental knowledge. In the field of 
radar the illuminated target serves as the discrete radio source. In the 
field of space exploration the radio transmitter on-board the space 
vehicle serves as the discrete radio source. In the field of radio astro- 
nomy the “radio star” serves as the discrete radio source. 

The “radio star” is a remarkable example of a discrete radio source. 
In the past twenty years radio astronomers have discovered that na- 
ture provided many discrete radio sources or radio stars at certain 
locations in the sky. What are the locations of these radio stars? What 
is the power spectrum of the observed radiation from a particular 
radio star? Answers to such questions are of fundamental importance 
in the field of radio astronomy. 
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In order to measure relatively small values of radiated power from 
a discrete radio source, one must compete with the inevitable back- 
ground noise and the inevitable radio receiver noise. It is well known 
that one requires a method of measurement which is relatively in- 
sensitive to receiver gain fluctuations. The papers by Dicke® and 
Ryle® discuss this important point in more detail. In fact, the present 
day method for measuring relatively small values of radiated power 
from a discrete radio source makes use of some form of the Dicke? 
radiometer or the Ryle? interferometer. 

The purpose of this paper is to describe a phase principle for meas- 
uring the location or the spectral shape of a discrete radio source. We 
shall see that the phase principle is relatively simple to implement 
and leads to a measurement of location or spectral shape which is 
insensitive to receiver gain fluctuations. For measuring the location 
or spectral shape of a weak, discrete radio source, we shall see that the 
theoretical accuracy associated with the phase principle is slightly 
better than the theoretical accuracy associated with the Dicke radi- 
ometer or the Ryle interferometer. We shall also see that for measur- 
ing the location or spectral shape of a weak, discrete radio source 
using only phase information, the accuracy associated with the phase 
principle is essentially equal to the accuracy associated with the maxi- 
mum likelihood principle. 


II. MEASUREMENTS BASED ON THE PHASE PRINCIPLE 


2.1 Implementation 


Fig. 1 illustrates a simplified implementation of the phase principle 
for measuring the location or spectral shape of a discrete radio source. 
S(é), N,(6), and N.(é) represent zero mean, independent, narrow-band, 
stationary Gaussian processes. N,(t) and N.(t) each represent the sum 
of background noise plus receiver noise. N,(t) and N.(t) are assumed 
to have equal variances. The spacing, d, between the two antennas is 
many wavelengths in order that N,(t) and N.(t) can be considered as 
independent processes. S(t — At) and S(é + At) are due to the presence 
of a discrete radio source located at a small angle 6 with respect to 
boresight. We assume that the receivers preserve the phase difference 
between the antenna excitations. 

S(t — At) + N,(@) and S(é + At) + N,(t) represent the outputs 
of the two receivers. 7; represents the ith independent sample of the 
phase difference between S(é — At) + N,(t) and S(é + At) + Nei). 
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Fig. 1— Simplified implementation of a phase principle for measuring location 
or spectral shape of a discrete radio source. When source is located at d sin 6 = 
\/4, nt =: 0. When source is located at @ = 0, receivers scan in frequency and yt 
traces out a measure of spectral shape. 


n: 18 taken to be in the primary interval (—7z, 7). After n such samples 
the output 7' is given by 


pi Se akan: (1) 
nN ¢=1 


where 


n = Br 
B = IF bandwidth 
= observation time. 


4y 


We shall assume that n is relatively large like n = 10°, since we are 
primarily interested in observing relatively weak, discrete radio sources. 

Fig. 2 illustrates a relatively simple method for generating 7' from 
the inputs S(t — At) + N,(é) and S(é + At) + N.(t). Ri, wo, and 4, 


represent the envelope, IF angular frequency, and phase angle, respec- 





V2 COS|wot +6 
S(t-At)+Ni(t)=R,cos[wot+ 6,| BAND- | |” [wct+4 
PASS 


LIMITER 





cos (2-6:) 
/ 









Low- nt 
PASS AVERAGE 
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S(t+At)+Na(t)=Recos[wot +62] Nave cos[wot +82] 


Fig. 2—A method for generating i from the inputs S(t—At)4+N,(¢) and 
S(t+At)-+N2(t). The band-pass limiters remove all amplitude information. 


418 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1968 


tively, of the narrow-band Gaussian process S(t — At) + N,(é). Sim- 
ilarly, R2, wo, and 6, represent the envelope, IF angular frequency, 
and phase angle respectively of the narrow-band Gaussian process 
S(é + At) + N.(t). The band-pass limiters shown in Fig. 2 are well- 
known devices for removing all amplitude information and preserving 
the phase information as is indicated in Fig. 2. See Davenport and 
Root* for a discussion of the band-pass limiter. 

As indicated in Fig. 2, if one takes the product of ~/2 cos (wot + :) 
and ~/2 cos (wot -+ 62) and passes the result through a suitable low- 
pass filter the result is cos (86. — 61) = cos 7(t). By taking the average 
of this result, one can generate 7}. 

This method of generating 7‘ can also be used to help implement 
the phase principle described in Ref. 5 in order to detect the presence 
of a discrete radio source located at 6 = 0. 

Fig. 2 indicates clearly that nt is independent of receiver gain fluctua- 
tions or changes in the average receiver gain of each receiver. Also, 
the two receivers need not have the same average gain. Thus, an 
unusually long observation time 7 is advantageous when using the 
phase principle. 

Notice that if the band-pass limiters in Fig. 2 are shorted out, we 
have the well-known correlator configuration.°"”"*"° 

We shall now show that a measurement of 7* leads to a measurement 
of location or spectral shape of the discrete radio source. 


2.2 Statistical Properties of 7; 


In order to simplify the analysis, we shall always assume that the 
discrete radio source is at a small angle 6 with respect to boresight. 
To begin, we shall state some known statistical properties of the angle 7; . 

Equation (34) of Ref. 10 gives the probability density p.(n) of each 
independent sample 7; as 








1~—T 2-3 s =I 2 7. a2 
p(n) = OF ad coo Bo) ls. sm Bo + tee + V1 = a | ’ (2) 
where 
B. = Ll cos (n — m9) 
a 
ce l+a 


_ Var SQ) _ Var S(Z) 
© Var N,(t) Var N2(8) 
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Var = Variance 


no/2 = + dsin 8 = a At 
et oe vis 
SS sn he SS 


The Fourier series development of (2) follows from Middleton’s”’ 
equation (9.33) 


1 S7TT 2 1 
Po(n) = 1 iS POR +) p(t, a eae t) cosn(y — 
W n=1 
| (3) 
where 2F is the Gaussian hypergeometric function 
aes, a oB eat 1e6+)o , 
a (a, By y32) = 1+ Te + vy + 1) att 
and I is the gamma function. 
The expectations HZ cos y; and E cos’ 7; follow from (38) 
E cos 9; = 2 PAG, 33 2; 1°) cos ng (4) 
Le Ae 
E cos’ 9; = at a oF,(1, 1; 3; 1’) cos 2, . (5) 


We shall see that the phase principle for measuring location or spectral 
shape of a discrete radio source is based on (4). Equation (4) should 
be compared with (4) of Ryle,’ the equation which characterizes the 
output of a Ryle interferometer. Both equations are proportional 
tO COS 7». 


2.3 Measurement of Location 


Let us first consider the problem of measuring the location of a 
discrete radio source whose true location is some small positive angle 0. 
From (4) we see that # cos n; = 0 when 7) = 7/2 or d sin 0 = X/4. 
This suggests that we observe 7! and conclude that d sin 6 = /4 
when 7t = 0. How accurately can we form an estimate 6 of @ in this 
manner? 

For yo near 7/2 let the estimate #4, of 7» be determined from the 
linear equation 


l ; 
n = Z iG, a 2; n(2 — i): (6) 
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Thus, 


2 —2 
Var 4, = eny(2) Var 6 = | var | @).na. 2:2; | (7) 


or, in a more suitable form, 


2 
(22) var 6 =A + a) cos wPAG HD, @ 
where EF cos’ n; is given by (5) with 7) = 2/2. Equation (8) char- 
acterizes the theoretical accuracy associated with the phase principle 
for measuring the location of a discrete radio source and is plotted in 
Fig. 3. 


2.4 Measurement of Spectral Shape 


Now let us consider the problem of measuring the spectral shape 
of a discrete radio source located at 6 = 0. We shall assume that the 
variances of the background noises and receiver noises are invariant 
over the frequency region of interest. Under these conditions the esti- 
mate 4 of the signal-to-noise power ratio ‘‘a’”’ can serve as an estimate 
of spectral shape by using the well-known frequency scan technique 
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Fig. 3— Theoretical accuracies of Ryle interferometer, phase principle, and 
maximum likelihood principle (using only phase information) for measuring 
the angular location of a discrete radio source. 
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indicated in Fig. 1. How accurately can we form an estimate 4 of ‘a’ 
in this manner? 

Equation (4) with @ = 0 or 7, = O defines ‘‘a” as an implicit func- 
tion of H cos yn; which we shall indicate by 


= H(E cos n;). (9) 


Equation (9) suggests that we form an estimate @ of “a” from the 
equation 


= H(n). (10) 


A plot of (10) is presented in Fig. 4. This figure can be considered as a 
theoretical calibration curve. Notice that the theoretical calibration 
curve is independent of changes in the average receiver gain of each 
receiver. This is indeed unusual. One measures y! and reports the cor- 
responding value of d. Assuming that Var NV, and Var N, are invariant 
with frequency over the frequency range of interest, @ will then trace 
out the spectral shape of the discrete radio source as the receivers scan 
in frequency. We shall now characterize the accuracy of the estimate d. 

For large n, the only case of interest in this paper, Cramér’s” work 
shows that the estimate 4 is characterized, approximately, by a Gaus- 


2.0 


0.4 


0 0.08 0.16 0.24 0.32 0.40 0.48 0.56 
nt 


Fig. 4— The theoretical calibration curve associated with the phase principle 
for measuring (Spectral shape of a discrete radio source. The receivers scan in 
frequency and 4 traces out the spectral shape. 
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sian probability density having the following expectation and variance: 











Eé = H(E cos n;) + OM") = a (11) 
Var 6 = H? Var y' + O(n!) = H? Var 1’, (12) 
where , 
A f . -1 
H, = did _ | COS 7; | 
dn E cos 7% da 0 
4 1 2 r’ —-1 
-40 TE nae) +n hah| - 


This last result follows by differentiating (4) with respect to ‘a’, 
setting » = 0, and then taking its reciprocal. Equation (11) implies 
that the estimate d is essentially unbiased for large n. 

From (12) and (1) we have 


n Var 6 = Hi(E cos’ 4; — E” cos n,), (13) 


where E cos n; and E cos’ n; are given by (4) and (5) with 7) = 0. 
Equation (13) characterizes the theoretical accuracy associated with 
6b? 


the phase principle for measuring the signal-to-noise power ratio ‘‘a 
or the spectral shape of the discrete radio source and is plotted in Fig. 5. 


III. MEASUREMENTS BASED ON THE RYLE INTERFEROMETER OR DICKE 
RADIOMETER 


3.1 Measurement of Location 


When 6 is small and yn, = 7/2 or d sin 6 = )/4, the theoretical ac- 
curacy associated with the Ryle interferometer for measuring the 
location of the discrete radio source was derived by Manasse.’* In 
our notation Manasse’s’® (60) becomes 


(aa) Var 6 = (2n)"(1 + a)’. (14) 


Equation (14) characterizes the theoretical accuracy associated with 
the Ryle interferometer for measuring the location of the discrete radio 
source and is plotted in Fig. 3. 


3.2 Measurement of Spectral Shape 


One can measure the spectral shape of the discrete radio source 
located at 6 = O by using the Ryle® interferometer or the Dicke’ radi- 
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Fig. 5 — Theoretical accuracies of Ryle interferometer, Dicke radiometer, phase 
principle, and maximum likelihood principle (using only phase information) for 
measuring the spectral shape of a discrete radio source. Both the Ryle inter- 
ferometer and the Dicke radiometer require some amplitude information. 


ometer. In fact, these methods are at present the accepted methods. 
We shall go on to characterize the accuracy associated with these 
methods of measuring spectral shape. 

In order to simplify the notation of this section, let P, = Var S(t) 
and Py = Var N,(t) = Var N.(t). Then, 
and @ = Be 


& 


Py 








a= (15) 
P, denotes the unbiased estimate of P, and Py is regarded as a param- 
eter. 

The Ryle’ interferometer utilizes a phase reversing switch to produce, 
periodically, the following inputs to a square-law detector 


25(i) + Ni) + Nold) 
a aaa oe 
or 
Ni) + Ni), 
; (17) 
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Thus, using some of Rice’s’* results, the mean values at the output 


of the square-law detectors are, periodically, 


2P, + Pw 


5 (18) 
or 
Py 
ig (19) 


The difference in the outputs of the square-law detector is taken as 
the unbiased estimate P,. Thus, EP, = P, , and Fé = a. 

Also, using some of Rice’s’* results, the variances at the output of 
the square-law detector are, periodically, 





QP, + Pyl 
A(n/2) (20) 
or 
[Py]? 
40/2 ey 


The factor n/2 appears because the output is in either position only 
one half of the time. Since the difference in the outputs of the square- 
law detector is taken as the unbiased estimate P,, the variance of 
P, is given by the sum of expressions (20) and (21): 

_ [2P, + Py]? + [Px] 


a (22) 


or 

n Var 4 = nPx’ Var P, = 2a° + 2a + 1. (23) 
Notice that the Ryle interferometer can be considered as a Dicke’ radi- 
ometer switching between the two inputs given by expressions (16) 
and (17). Thus, (23) characterizes the accuracy of both the Ryle 


interferometer and the Dicke radiometer for measuring the spectral 
shape of the discrete radio source. Equation (238) is plotted in Fig. 5. 


IV. MEASUREMENTS BASED ON THE MAXIMUM LIKELIHOOD PRINCIPLE 
USING ONLY PHASE INFORMATION 
4.1 Measurement of Location 


If one uses the maximum likelihood principle” to process a large 
number n of independent samples of the phase difference y; in order 
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to estimate the location of the discrete radio source when 7,5 = 7/2 
or d sin 6 = )/4, one finds 


n( a4) Var 6 = —4, a? Var fy 
; (27) (24) 


al Geof 
7 (2z)” -z LD2 \a Ono r/2 ie 


where 7p, is given by (2) and the integrand of the integral in equation 
(24) is to be evaluated at n» = 7/2. For various values of ‘‘a’’, the 
definite integral appearing in (24) was evaluated numerically by using 
a digital computer and Simpson’s rule. The resulting curve is plotted 
in Fig. 3. Incidentally, this curve applies for all values of np. 

As a — 0 we find that (8) and (24) both yield 


, 2 
lim |n(22) Var i| = 2 = 0.02053. (25) 
a0 rd vis 

Thus, as a — 0, the phase principle and the maximum likelihood 
principle using only phase information are essentially equivalent. 


4.2 Measurement of Spectral Shape 


If one uses the maximum likelihood principle’ to process a large 
number n of independent samples of the phase difference 7; in order to 
estimate the signal-to-noise power ratio ‘‘a’”’ or the spectral shape of a 
discrete radio source when 6 = 0, one finds 


; T 1 Oe 2 . -1 
Vv = a — | 22] d } 26 
nm Vara , 0 De da 0 ( ) 


where 7p, is given by equation (2) with 7, = 0. For various values of 
‘‘q”’, the definite integral appearing in (26) was evaluated numerically 
by using a digital computer and Simpson’s rule. The resulting curve 
is plotted in Fig. 5. This curve also applies for all values of 15 . 

As a — 0 we find that (18) and (26) both yield 


7.00 


lim [n Var 4] = —3 = 0.81057. (27) 


a-+0 


Thus, as a — 0 the phase principle and the maximum likelihood principle 
using only phase information are essentially equivalent. 
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V. COMPARISONS OF THEORETICAL ACCURACIES 


5.1 Measurements of Location 


When using the Ryle interferometer to measure the location of a 
weak, discrete radio source located at d sin 6 = \/4, we have, from (14), 


ad\’ ] ae ee 
lim | n(22) Var 6] = Qn? = 0.02533. (28) 


a0 
Whereas, when using the phase principle or the maximum likelihood 
principle to measure the location, we have, from (25), 


lim | »(22) Var | = 2 = 0.02053. (29) 
a0 r 7 
Thus, the phase principle and the maximum likelihood principle are 
essentially equivalent, and they are both slightly more accurate than 
the Ryle interferometer. 

See Fig. 3 for a comparison of the theoretical accuracies at other 
values of “a”. 


5.2 Measurements of Spectral Shape 


When using the Ryle interferometer or the Dicke radiometer to 
measure the signal-to-noise power ratio “a” or the spectral shape of a 
weak, discrete radio source located at 6 = 0, we have, from (23), 


lim [n Var 4] = 1. (30) 
a0 
Whereas, when using the phase principle or the maximum likelihood 
principle to measure the signal-to-noise power ratio “a” or the spectral 
shape, we have, from (27), 


xe] 00 


lim [n Var 4] = -3 = 0.81057. (31) 


a-0 Tv 
Again, the phase principle and the maximum likelihood principle are 
essentially equivalent, and they are both slightly more accurate than 
either the Ryle interferometer or the Dicke radiometer. 

See Fig. 5 for a comparison of the theoretical accuracies at other 
values of “a”’. 

For values of “a” away from zero, Fig. 5 shows that the Ryle inter- 
ferometer or Dicke radiometer are more accurate than the maximum 
likelihood principle using only phase information. Thus, one must. 
conclude that the Ryle interferometer or the Dicke radiometer require 
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some amplitude information. Consequently, their accuracy is subject 
to deterioration by gain variations. 

Notice that (29) divided by (28) equals 8/m?, and (81) divided by 
(30) also equals 8/x*. Thus, for measuring the location or the spec- 
tral shape of a weak, discrete radio source, Var 6 and Var 4 associated 
with the phase principle are lower, by the same factor 8/7”, than the 
corresponding variances associated with the Ryle interferometer. 


VI. CONCLUSIONS 


For measuring the location or the spectral shape of a discrete radio 
source, the phase principle leads to a measurement which is insensitive 
to receiver gain fluctuations. 

For measuring the location or the spectral shape of a weak, discrete 
radio source, the accuracy associated with the phase principle is 
slightly better than the accuracy associated with the Ryle interferom- 
eter or the Dicke radiometer. Also, the accuracy associated with the 
phase principle is essentially equal to the accuracy associated with 
the maximum likelihood principle using only phase information. 

The phase principle is relatively simple to implement, and the im- 
plementation doesn’t require input switching. 

The calibration curve associated with the phase principle is inde- 
pendent of changes in the average receiver gain of each receiver. The 
two receivers need not have the same average gain. 

An unusually long observation time is advantageous when using the 
phase principle. 

For measuring spectral shape, both the Ryle interferometer and the 
Dicke radiometer require some amplitude information. Consequently, 
their accuracy is subject to deterioration by gain variations. 
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Some Considerations of Broadband 
Noise Performance of Optical 
Heterodyne Receivers 


By V. K. PRABHU 


(Manuscript received September 8, 1967) 


We derwe an explicit expression in this paper for the spot noise factor 
of a perfectly aligned optical heterodyne receiver consisting of a semicon- 
ductor photodiode followed by an IF amplifier. We show that this noise 
factor Fz , which is a function of the admittance of the diode, varies as a 
function of the modulation frequency. We obtain constraints tmposed by 
the photodiode on the broadband noise performance of the optical receiver 
for any arbitrary lossless interstage network. The integral form of the 
constraint shows that the noise factor Fz cannot be made equal to its optimum 
value Fro over any nonzero band of frequencies. We give explicit expressions 
for the amount of tolerance of broadband noise performance obtained with 
lossless interstage networks. We show that for certain types of approxima- 
tions, and for a certain transistor IF amplifier usually used in practice, 
the interstage network which achieves broadband signal performance for 
the receiver also obtains broadband noise performance. The theory of 
broadband noise performance we present for optical heterodyne receivers 
can also be applied to the study of broadband noise performance of other 
linear systems normally encountered in practice. 


I. INTRODUCTION 


Semiconductor photodiodes like Schottky barrier diodes or con- 
ventional p-n or p-i-n diodes are increasingly being used for detec- 
tion in optical heterodyne (double detection) receivers.1-* They nor- 
mally are fast and efficient, converting up to 70 per cent of the photons 
of the light beam into photoelectric current." Because of the intensity 
of the light beam, almost all photodiodes used in optical detection 
give an output proportional to the intensity of light.t However, this 
output is normally so small that further amplification is required, and 


429 


430 THE BELL SYSTEM TECHNICAL JOURNAL, MARCH 1968 


so any practical receiver consists of a photodiode followed by a high- 
gain low-noise IF amplifier. 

We have already considered in detail the signal performance of such 
receivers.!* In this paper we shall deal only with the noise performance 
of the receiver. The output of the diode is usually corrupted by noise 
generated within the diode and elsewhere in the system. We discuss 
briefly the characteristics of the photodiode in Section II and give its 
equivalent circuit. In Section III we discuss the noise performance of 
the IF amplifier and show that its noise factor is a function of its 
source admittance.’* We show that the noise factor Fz of the optical 
receiver is a function of frequency in spite of the fact that the IF 
amplifier has a broadband noise performance characteristic. 

In Section IV we discuss the role of the lossless interstage network 
in achieving broadband noise performance of the optical receiver and 
show that it is impossible to make the noise factor Fr equal to its 
optimum value Fro over any nonzero band of frequencies. 

Section V shows that Butterworth and Chebyshev approximations 
to Fp are realizable, and we obtain the tolerance of broadband noise 
performance for these approximations. We show that, for the photo- 
diodes normally encountered in practice, this tolerance ¢? is a monoton- 
ically increasing function of the complexity of the interstage net- 
work but decreasing for Chebyshev approximations. 

We show in Section VI that to obtain broadband signal and noise 
performance characteristics from the optical receiver two separate 
lossless interstage networks are necessary. However, we also show 
that for certain types of approximations and for a certain transistor 
IF amplifier, the interstage network which achieves broadband signal 
performance also obtains broadband noise performance for the opti- 
cal receiver. 

The theory of broadband noise performance presented in this paper 
for an optical heterodyne receiver can also be applied to obtain broad- 
band noise performance of other linear systems normally encountered 
in practice. 


II. AVAILABLE SIGNAL AND NOISE OUTPUT POWERS | 


Fig. 1 shows the optical heterodyne receiver that we discuss. It 
consists of a photodiode followed by a lossless interstage network, and 
an IF amplifier of center frequency 2, and a semibandwidth W.+ The 


; + The amplifier may, depending on the frequency of modulation, use vacuum 
tubes, transistors, masers, parametric amplifiers, tunnel diodes, or other active 
devices. 
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Fig. 1—A double detection optical receiver. The input to the photodetector 
is the sum of the local oscillator beam and the incoming signal beam. 


geometrical center frequency w, of the IF amplifier is defined as 
wo = {(@, — W)(@, + W)}™. (1) 


The available gain G, of the IF amplifier and its optimum noise factor 
F, are assumed to be independent of the frequency w forQ, — W S$ 
oSQ+ Wt 

_ In this paper we shall not consider any effects on the noise per- 
formance of the optical receiver of beam misalignment, nonuniformity 
of the surface of the diode, or distortion from transmission through 
a nonhomogeneous atmosphere.”° Because background noise has been 
shown to be of almost negligible consideration, we shall assume that 
this noise has no effect on the broadband noise performance of the 
optical receiver.?? ? 

The diode is normally so arranged that the junction or portions 
of it close to the junction are illuminated by the sum of the local 
oscillator beam and the incoming signal beam. The electron-hole 
pairs thus created by the incoming photons give rise to a small signal 
current.t In operation, a reverse bias V, is put on the diode, and the 
characteristics of the device!” for small excursions around V, are that 
of a signal current generator J,, and a direct current generator J,, in 
parallel with a capacitance C(V,), and this combination in series with 
a parasitic series resistance R(V,). 

The time-average current I, is caused both by the time-average 
illumination and by the electrons and holes that are generated at or 
near the junction. The signal current J, is caused by that portion of 
the illumination at or near the signal frequency of interest. In general, 
C(V,) and R(V,) are functions of the bias voltage. Assuming that, 
in practical cases, the excursions around the bias point are small, we 
shall henceforth assume that C'(V,) is a constant capacitance C,, and 
the series resistance R(V,) is a constant resistance R. 

t Since G, and F, are real and even functions of w, we shall only consider w > 0. 


The case in which G,; and B.; are functions of frequency w is very complicated and 
will not be discussed in this paper. 
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To account for any thermal noise generated in the photodiode, we 
use an equivalent noise-voltage generator e, with mean-squared valuet 


le, = 4kTUR Af, (2) 


where 74 is the temperature of the diode, k is Boltzmann’s constant, 
and Af is the spot frequency band about which we are concerned. In 
addition, there is another source of noise, shot noise, present in the 
photodiode. This can be accounted for?* by placing in parallel with J, 
and I, a shot noise current generator 2, with mean-squared value 


[in F = 2q7, | for) P Af, (3) 
where q is the electronic charge, and f(wr) is a transit time reduction 
factor, 7 being some effective transit time. We assume in this paper 
that f(wr) which always satisfies the inequality 


epee) |e, (4) 
can be considered independent of win, — W < w < 2, + W, and that 
| f(r) | = 1. (5) 


The equivalent circuit of the photodiode which describes its terminal 
signal and noise characteristics is shown in Fig. 2. This equivalent 


en 


R=1/G 





Y(P) 


Fig. 2— Equivalent circuit of photodiode. The physical sources of noise that are 
present in the diode are also shown. The conductance G, which appears in parallel 
with C is usually so small (around 107-7 mho) that it can be neglected for all practical 
purposes if w > 0. 


circuit very well describes the behavior of the diode provided the lowest 
frequency of the signal occurring in the system is very far from zero, 
or w > 0. 

The peak photoelectric current J, , and de current J, for a double 
detection optical receiver can be shown to be given by”’'”* 


1, = 5 VPP, (6) 


+ The horizontal] bar denotes an average. 
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and 


a9 
‘f 9 hy eae dea )s (7) 


where » is the quantum efficiency of the photodiode,t h is Planck’s 
constant, v is the optical frequency, P, is the signal power, and P, is 
the local oscillator power.t 

The available signal and noise powers are easily determined from 
Fig. 2. The signal available power S,¢ and noise available power Noa 
can be written as§ 











+ ilebet: 
Soa = 8H CPR ’ (8) 
and 
,. 72 - 12 
Nye = Hit ob Lae ® 


An important quantity that characterizes the noise performance of 
the photodiode is its signal-to-noise ratio S,~/Npa. According to equa- 
tions 2, 3, and 5 through 9, this is given by] 


Spa _ (x2) PP, (10) 
Noa hv OMT af) Af i o(22)P Ay ’ 
eure a Na hv] ° 
where 
1 
e = BG (11) 


The signal-to-noise ratio at the input to the diode will be defined 
as the best possible signal-to-noise ratio which an ideal detector could 


+ A quantum efficiency of greater than 70% has been obtained [11] for Schottky 
barrier photodiodes. 

t In practice, a fraction k < 1 of incident photons are absorbed in the active 
region of the diode. To account for this effect, J, and J, are usually multiplied by a 
factor k. We assume that k = 1. J, is also usually multiplied by a reduction factor 
similar to the shot-noise function, f(wr), determined by the signal frequency, optical 
wavelength, and device construction. We assume that this factor is unity. 

To account for any mismatch between the local oscillator beam and signal beam, 
if and J, are also multiplied by a beam matching factor 6 where 8 S 1. We assume 
that 8B = 1. 

§ We can argue from the physics of the diode that the shot noise source and 
thermal noise source shown in Fig. 2 are uncorrelated. 

{ We can assume that P, > P;,so that P, + P, =~ Po. 
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achieve. This can be shown to be given byt 


Pes. 
hy Af 





(S/N) in = (12) 


Ill. IF AMPLIFIER NOISE FACTOR 


The terminal characteristics of any IF amplifier used in the optical 
receiver (see Figs. 3 and 4) can normally be described by 


a 7 fe a & la (13) 
2 21 Yo2 Ve Nia 


SOURCE -FREE 
TWO PORT 










Fig. 3— Separation of twoport with internal noise sources into a source-free 


twoport. 
Pet] 
iy C D | —TI, nN; 


where ny and ni, or ny and n; characterize all physical sources of 
noise present in the IF amplifier.1* we assume that the IF amplifier 
has ideal broadband signal and noise performance characteristics and 
that the amplifier remains stable for all linear passive input and out- 
put terminations.?*8 

By definition, the spot noise factor at a specified frequency of any 
linear twoport network (such as an IF amplifier) is given by the ratio 
of the total output noise power per unit bandwidth exchangeablet at 
the output port to that portion of that power which is engendered by 
the input termination at the standard temperature 7,.1* To derive 


or 


} Ref. 25 shows that it is impossible to measure amplitude and phase of an 
incoming optical signal with a better signal-to-noise ratio than given by equa- 
tion 12. 

+ Exchangeable power, exchangeable gain, etc. coincide” with available power 
and available gain when the output impedance of the amplifier is positive-real. 
They are the logical generalizations of the available power and available gain 
when the output impedance has a negative-real part. Since the amplifier is 
assumed to be absolutely stable, we may substitute the word “available” for the 
word “exchangeable” wherever it appears in this paper. 
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the noise factor of the IF amplifier we are considering, let us connect 
the amplifier to a statistical source comprising an internal admittance 
Y,, and a noise current generator I, (see Fig. 5). It can be shown 
easily that the noise factor F is given by 


| 2; + MX e 
| Ins |’ 


We notice that the mean-square source noise current is related to the 
source conductance G, by the Nyquist formula 


PF=t+ (15) 








| Ing |? = 4kT.G, Af. (16) 
Also, we can express the noise voltage fluctuation | n, |’ 
an equivalent noise resistance R,, as 


[n, |? = 4kT.R, Af (17) 


in terms of 









SOURCE-FREE 
TWO PORT 






Fig. 4 — The Rothe-Dahlke noise model for a linear twoport network. 


and the noise current fluctuation | 7; |? can be expressed in terms of 
an equivalent noise conductance G, where 


|n; |? = 4kT7.G, Af. (18) 
Let us also write 
nt = 4kT .pVR,G, Af, (19) 


where p is a complex number. It can be shown that 


letet. (20) 


From equations 15 through 19, the formula for the noise factor be- 
comes§ 


G, + 2VR,G. {G, Re p — B, Im p} + (G2 + BR, 





F=1+ (21) 


As we can see from equation 21, the noise factor F is a function of 
the source conductance G, and also of its susceptance B,. We can 


§ Re a and Im a denote the real and imaginary parts, respectively, of the 
complex number a. 
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show that F attains its optimum valuef 


=1+2VR,G, {Re p + [1 — (Im p)’}'} (22) 


for a certain source admittance Y.; = G.; + 7B,; where 


VEG. 


Gop = {1 — (Im p)*}? (23) 
and 
RG. 
Bor = VEG. Im (24) 
en 











=e 
SOURCE -FREE 
Ins @an ee © ae TWO PORT 
© 


Fig. 5 — Network for noise factor computation. 


Using equations 21 through 24 (see Ref. 14), we can show that F can 
be written in the form} 


P=F, +2 /¥.— ¥eP, (25) 


where the value of R,, 1s as given in equation 17. 

Let Yo(p) be the admittance of the photodiode as seen by the IF 
amplifier (see Fig. 1). We shall now compute the over-all signal-to- 
noise ratio for the optical receiver, and its overall noise factor Fp. In 
defining Fz, we shall use the concept of noise factor as originally in- 
troduced by Friis and Franz.*+ This noise factor Fg is defined as 


(S/N)in 
(S/N)out : 
where (S/N)in and (S/N) out are the input and output exchangeable 


signal-to-noise power ratios of the optical receiver. From equations 
10, 12, and 26 we can write 


Fp = (26) 





1 It can be shown that Re p + [1—(Im p)?]1/2 is always a nonnegative quantity. 

+It can be shown (see Ref. 30) that equation 25 can also be written in the 
form (G,—Gor)2 + (B:—Bor)? = F2, where Gor, Bor, and F, can be determined 
from Equations 21-25. 
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CS/N) out 
_ oP, 1 
— WwAf ORT, 1 (2 y { aes a zl : 
yr Be ge FP, + Re y, | Ya > Yor maar ih 
(27) 
and 
1 QkT, 1 [w \ 
i abe al, (2) 
7 R,, Fees 2 T's _ re. 
{P. = Re ‘eo | Y, Yor | te ia (28) 


We now note from equations 27 and 28 that (S/N)our and Fr are 
functions of the frequency of the detected signal, and for Y2 = Yo,, 
(S/N) out, and Fz attain their optimum values (S/N)o and Fro where 


nF, 1 








S/N)o = > ;yW 29 
( / Jo Pg Eee +P tl ( ) 
qi, R \o, ° T, 
and 
1 2kT, 1 w \’ T, —T, 
aaa E +O, RB (:) (F. caesar } of) 


If optimum noise performance of the heterodyne receiver at a finite 
number of signal frequencies is desired, it can be shown’ that suitable 
lossless interstage networks can be designed so that (S/N)o in equa- 
tion 29 and Fro in Equation 30 can be realized at the respective signal 
frequencies. If the band of frequencies of interest is continuous and 
nonzero, it can be shown that we cannot make Fz equal to Fro over 
the whole band.1*}15 The question arises whether there are any con- 
straints to be satisfied by Fz, imposed by the diode or any other com- 
ponents of the system, and what lossless interstage networks must be 
used to make Fz as close to Fro as possible. We shall discuss these two 
topics in the rest of this paper. 


IV. DERIVATION OF INTEGRAL CONSTRAINTS 
We shall use the results obtained in the theory of broadband match- 
ing of linear systems’*’**-*® to derive the expressions which relate 


+ The methods of derivation of most of the results in this section are very 
similar to those in Ref. 13. 
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the noise factor of the optical heterodyne receiver to other parameters 
of the system. The integral relation shows that it is impossible to 
make F, equal to Feo over any nonzero band of frequencies. Since 
the IF amplifier is assumed to have ideal broadband signal and 
noise performance characteristics, it follows that F, and Y,, are inde- 
pendent of frequency w for2, — W S » S$ 2, + W.T It also follows 
that B,, = 0.§ Without any loss of generality we shall normalize all 
admittances with respect to G,,. 

Let us look at Fig. 6 and define two reflection coefficients p,(p) and 











LOSSLESS 
INTERSTAGE 
NETWORK 


Fig. 6— Lossless interstage network used in equalizing the noise perform- 
ance of the optical receiver. Y(p) is the admittance of the photodiode as 
shown in Fig. 2. 


p2(p), and an all-pass function 8(p) where 


_ Yilp) — Y(—p) 


ne) = FOS (32) 


B(p) = re (33) 


and p = o + yw is the complex frequency variable. Y(p) is the admit- 
tance of the photodiode as seen by the lossless interstage network, 
and a,, a, *** , @m are the poles of Y(—p) in Re p > O. Since the 
interstage network is lossless,** it can be shown that 


| prGjw) | = | po(jw) |. (34) 
Also from Equations 25, 32, and 34 
1 1 ARG os 
Tea Se eee ; 
| piljo) la | p2(jo) |? + ff Set 


~The study of the case in which F. and Yor may be functions of frequency 
is very. complicated and beyond the scope of this paper. 
§ For any realizable admittance Y.,s , Bo.s(w) must be an odd function of w. 





(35) 
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From equations 31 and 33, we can also write 


Y(p) + Y(—p). 
Vp) + ¥@) ey) 


Equation 36 shows that regardless of the lossless interstage net- 
work used in the receiver, every zero of y(p) = 4 [1+ Y(—p)/Y(p)] 
in Re p > 0 must also be a zero of 


S(p) = B(p){1 — pilp)}. (37) 


A zero p, of y(p) in Re p > 0 of multiplicity k is said to be a zero 
of transmission of the admittance Y(p) of order k. Youla distinguishes 
four kinds of such zeros.*? Class 1 contains all those in the strict right- 
half plane. Class 2 contains all those on the real-frequency axis which 
are simultaneously zeros of Y(p). Class 3 contains all those on the 
real-frequency axis for which 0 < | Y(p.) | < oo, and class 4 contains 
all those for which | Y(p,) | = 0. The restrictions imposed on 1 (p) 
through equation 36 are formulated** *>1° most compactly in terms of 
coefficients of the power series expansions of the following quantities: 


B(p){1 — pilp)} = B(p) 


s) = B@)alr) = D Sly — v0) (38) 

In s(@) = Do stp — po" (39) 
() = > Bw — po" (40) 

In 8@) = Ys lp — p.)" (41) 
Fo) = B@Y@) + Y(—pP= LM = ps 42) 
toh = Dhl — py (43) 
g(p) = oO = » 0.(p — po)’ (44) 


Also, let 7(p) be a regular all-pass network such that 


v 


PM 2 
pa ; 45 
n(p) Lata (45) 
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and 


In n(p) = > hosp (46) 


py’S are a set of points in the right half of the complex plane. 
Let 


1 ARG 
Se eS ; 
noe) =F, 


and let s,(p) be such that all its zeros and poles are in the left- half 
plane.{ It is now cleart that if 





(47) 








B(p) pi(p) = s(p) = n(p)s.(p), (48) 
Tide ace 1 _ 1 
| s(jw) |? | prio) |? | s.(ju) |? 
set nGof 
-l+ 7 a (49) 


We may now show1*?® that Y, i is a positive-real admittance if 
and only if: 


(t) At every class 1 transmission zero :, of order k, 











Se Ses Osrsk-l1; (50) 
or 
4R,G, 
b6, = exp +n — i j.n{1 + i sd 
- ad ARG of x 
e=0, if o {1 + Mabe 220, (51) 
_ a). ARN -. 
e=1, if a fy + eet 0, 
and 


=1-[imu + Babe a Larsk=—b. (2) 


TIt ete wl be recognized that so(p) is a minimum-phase function.* 

t Multiplication of p:(p) by B(p) is necessary to make it analytic in the right- 
half plane. This multiplication makes s(p) a bounded real scattering coefficient.” 
Multiplication of so(p) by (p) introduces right-half plane zeroes. This is some- 
ace) os ama and is done so that s(p) can satisfy all the constraints imposed 

y Y(p). 
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(iz) At every class 2 transmission zero jw, of order k, 


SS. = "Be.5 0<srsk-—-1, (53) 
and 


ae =U, (54) 


or 





ee jen {1 + SGer ga, OSrSk—-1, (5) 
and§ 


oa 


kW Sk 





> 0. (56) 


rat 
If | w, | = 0, or ©, equation 56 may be replaced by 


sis) 


bom +f fun {1 + 


Dirt 


=O) (57) 


(iit) At every class 3 transmission zero of order k, 


S.=B,, OSrSk-2, (58) 
and 
Suk -_ B,-1 ‘ 
SG S05 (59) 
or 
=n — [imi + Seesaw, Osrsk-1, 60) 
and 
Pea Sit > 0, (61) 
k 


with equality if and only if the matching network is nondegenerate. 
If | w, | = 0 or ~, equation 61 may be replaced by 


b,- toma tf fen {i + Saber a, 
+ _#= Fl so. 


Ht _ interstage network is nondegenerate!*-!’, equation 56 becomes an equality. 
If | Y(jwo) | ¥ ~, the network is said to be nondegenerate if and only if Yi(jwo) + 
Yj) Se If ! Yj) | = o, the network is said to be nondegenerate if and only if 
1\Jo Fo 


(62) 
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(tv) At every class 4 zero of order k, 





S,=B., O0<rSk-1, (63) 
and 
Peas 
~<a, ; 4 
S, aa B, = 4-1; (6 ) 
where a_, is the residue of Y(p) at p, = jo, ; 
or 


b= — | join {1 + ARaGar hy, 0Osrsk-—l, (65) 





F—F, 
and 
204-1 > 
b, = Sp = a- 2: (66) 
with equality if and only if the matching network is nondegenerate. If 
| w | = 0 or «, equation 66 may be replaced by 
seaiseaerear ied ue 7 Ss (67) 
bm + f juin {1 + Ger 


If equations 50 through 67 are satisfied, Yi(p) is a positive-real 
admittance. If Y;(p) is positive-real, the Darlington method can be 
used to obtain the lossless twoport interstage network needed in the 
receiver. 

Let us now use the theory of broadband noise performance that 
we have presented to derive the constraints imposed by the photodiode 
on the noise performance of the optical receiver. The normalized 
admittance Y(p) of the photodiode shown in Fig. 2 can be written ast 





sh Pa. 
Y(p) ra RG.s p + We (68) 


From equation 36 we can show that the only transmission zero of 
Y(p) lies at p, = 0, and is of order 1. Also from equation 33, 


_ P= & 
en (ee) eee (70) 
W. We 


} The equivalent circuit shown in Fig. 2 for the photodiode is valid for frequencies 
w wa 0. Also, without any loss of generality, we normalize all admittances with respect 
to Go fie 
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From equation 42 we can write 














oe 2p'C 

MO) = Goll + plas ee 
_ 1 [2Cp? — 4¢p’ me 
- Gor [ We Ww + 2) 


Since the transmission zero is of class 2, we can write from equations 
53 and 54 that 


S, = —l, (73) 
and 
S, S 2/e. , (74) 
where 
s(p) = +n(p)s.(p), (48) 
and 


1 = 1 4RAG os 
So(p)8.(—P) r—F, 
Also from equations 55 through 57 it follows that 


sy ae A.Gar | i poe fd 1\ ; 
a) Jin {ht eet a, ae: mt we i 


where ys are a set of points in the right-half plane. Since Re (1/m) 
= 0 for all l, we put 7 (p) = 1. We, therefore, have 





(47) 








ee ats 2 
= Lin {1 + Habe do <= (76) 
Also, from equations 25, 28, and 30, we can write equation 76 as 
ee Sk, RaGoy _(w/e.)? es 
=I. 4m[1+ nql, R Fr — Fro =, my) 


We must notice that R, and Gs are completely determined by the 
IF amplifier used in the system, and if we assume that the signal 
power P, remains constant at all frequencies of interest, equation 77 
shows that Fz cannot be made equal to Fre over any nonzero band 
of frequencies in spite of the fact that any arbitrary linear lossless 
interstage network may be used in the receiver. This is one of the 
important results of this paper. We must notice that the equivalent 
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circuit shown in Fig. 2 has been assumed in deriving equation 77. 
This equivalent circuit very well describes the behavior of the diode 
provided that the lowest frequency of the signal occurring in the 
system is very far from zero.1? Also we must observe from equation 
77 that Fp can be made equal to Fro at a finite number of discrete 
frequencies.?® 


V. RATIONAL FUNCTION APPROXIMATIONS 


We have shown that Fp cannot be equal to Fro over any nonzero 
intervalQ, — W Ss w S Q, + W of the frequency spectrum.t We shall, 
therefore, make some rational function approximations to a flat noise 
performance characteristic of the optical receiver. If these rational 
function approximations satisfy all the constraints of Section IV, a 
finite linear lumped lossless network can be found which realizes this 
kind of noise factor for the optical receiver.** A complete treatment 
of this problem is beyond the scope of this paper; but let us consider 
certain kinds of approximations widely used in network theory. 


5.1 Butterworth Approximations 


The problem at hand is to approximate F’, as close to Fz9 as possible 
over the rangeQ, —W Sw» SQ, + W. A set of polynomials which can 
be used for this purpose are Butterworth polynomials.**"*’ Let 


ee (r = ae be (a5e)"\ (78) 
R 0 RO ” WwW ) 
where n is the order of complexity of the interstage network to be used 
in obtaining broadband performance from the optical receiver, and 7 is 
also the order of the Butterworth polynomial. It may be verified that 
Fp approximates F'z9 in a maximally flat manner. The behavior of Fz 
as a function of w is shown in Fig. 7. Since it can be shown that Fp in 
equation 78 can be made to satisfy equations 73 through 77 by properly 
choosing ¢ for all values of n, the approximation of equation 78 is 
realizable. 
From equation 47, 


1 
s.(p)s(—p) = ———"7aR.G,i,’ (79) 


1 a 2 2\ 2n 
oa) 
20W 
{ Since the noise factor is a real and even function of w, we shall only consider the 
behavior of Fz for w 2 0 
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Fig. 7— Butterworth approximations of order n = 1,2. It is assumed that 
fe = 31.83 GHz, » = 0.70, Ta = 290°K, 7. = 500 wa, Fo = 2.0833, and & = 1. 





where 
1 
Mo= F, (80) 
It can be shown*® that 
80) = tee 
: = +o) + a,x2pW\(p +o) + + ap)’ 


(81) 
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where 


ae)? 
2 
A.) = (82) 


- 
sim s_ 


2n 


We can now expand equation 81 into a Taylor series? about p = 0. 
We have 





2Ww 
sp) = —1 ta. Srp (83) 
Now 
_ ‘ Po Mh 
1(P) UG + ui 
--yfr-p {bs bbe pall (84) 
tar (a wt) wt 
From equations 48, 74, 83, and 84, we have 
DW ween rr ts y) 
mice + Dl +4) s =. 5 
Sr a ar: Paley, ~ We 88) 


Since {(1/u:) + (1/u%)} = 0 for all J, let us put n(p) = 1. We can then 
write from equations 82 and 85 that§ 


aS 4R,G..M = 
MoM, . T 
(sin) 
A typical value of f, = w,/2m for a photodiode is about 31.83 GHz.] 
For this value of f, , f. = 2,./27 = 300 MHz, and 2W/Q, = 100 percent, 
we have plotted in Fig. 8 &;,/4R,G,,M, as a function of n. It may be 
seen from the plot that ¢2,,, is a monotonically increasing function of n. 
This behavior of ¢;, can be explained by the fact that Butterworth 
polynomials approximate the ideal broadband noise performance char- 


acteristic of the optical receiver in a maximally flat fashion’’. 
Since no useful purpose is served by using higher values of n, we 


(86) 





t We choose negative sign for s.(p) to satisfy equation 73. This does not 
entail any loss in generality.” 

§ Equation 86 can also be obtained by using equation 77. 

! Typical values of R and C for a photodiode are C = lyf, and R = 5 ohms.” 
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Fig. 8— A typical plot when Butterworth polynomials are used to approximate 
the ideal noise performance characteristic. Even though n is a discrete variable 
the plot is given for all n > 1. 


shall only consider the case n = 1.* For n = 1, 


ee (87) 
Wo Wo 
(2+ a.) 
and 
2 2 
s(—p) = s.(p) = prememey ce —— (88) 


p + 2p = + w% 
Now from equations 31, 48, and 88, it can be shown that 


1 - 
Y,(p) = RG, a (89) 
*"p 


Fig. 9 shows the circuit to realize Y,(p). Remember that w? is the geo- 
metric mean of the band of frequencies of interest, and 


1 
1 GG WiG. FW)’ _ 


t= VGuR. (91) 
This circuit agrees very well with our physical intuition. 


_*Since we are interested in minimum value of «? we haye used the equality 
sign in equation 77 
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5.2 Chebyshev Approximations 


Of the various means of approximating a given function, the Cheby- 
shev method is one of the most interesting and important. It can be 
shown that given a set of n parameters, a function f(w’) approximates 
g(w") in the Chebyshev sense if the parameters are determined in such 
a way that the largest value of | g(w’) — f(w’) | in a given interval is 
minimum.* Since for a given complexity of the structure the maximum 
amount of tolerance for a Chebyshev approximation is the same through 
the band, this type of approximation seems to be the most desirable 
in the broadband noise performance of optical receivers. 


, 
= CIRotWIIRo-W) 4, 


a Te ae 


AMPLIFIER 


L 


Fig. 9— Lossless interstage network for a Butterworth approximation of order 
n = 1. The ideal transformer ratio ¢ is given by ¢ = VRG,, - 


Let us try to approximate Fr by 


ed (7 = Ny i (tS) (92) 
Rk ~~ n RO n € n oIwW ’ 


where T,, (a) is an n™ degree Chebyshev polynominal given by** *> 37 
T(z) = cos (n cos” *x). (93) 


The behavior of Fz as a function of w for n = 1, 2 is shown in Fig. 10. 
The equiripple behavior of Fz is evident from equation 93. We can also 
show that the approximation of the type given in equation 92 can be 
made to satisfy equations 73 through 77. It can also be shown that 


_(p? +02)" + b,-o(2pW)(p? + 02)" ? + +: 





= OFF al Fai Faye? OV 
where 
aah E sine? 2 eGo | 
Gn-1 = a . (95) 


sin 1/2n 


* The Chebyshev approximating function has the equiripple property.3* 35, 37 
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-If we expand s,(p) about p = 0, we can show that 


2W 
so(p) = —1 + pan rt ee cae (96) 
We again put »(p) = 1 to obtain minimum &. From equations 74 
and 96 
4R,G.,M, 


ah a ee PG ge A Ne 
“nh? spol £ {Go }{ Sol 2 F_ 
sinh E sinh (#)(&) sin z\l 


(97) 
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Fig. 10 — Noise factor Fx as a function of w for Chebyshev approximations of 


order n = 1,2. It is assumed that f. = 31.83 GHz, 7 = 0.70, T. = 290°K, J. = 
500 wa, Fo = 2.0833, and & = 1. 
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In practical cases, assuming that w/w, K 1/V15, (wo/w.) (wo/W) sin 
a/2n “1, for w/W = +/15.t We can then write 


2 162,.G.; M o (9 8) 


€min = 2) (e): (sin x/2n) : 
™ \eo,./ \W. a /2n 
A normalized plot of &,;, for f, = 2,/2r = 300 MHz, f, = 31.83 GHz, 
and 2W/Q, = 100 percent is given in Fig. 11. We notice that 


tr Pea 
ee ae 2.0; (99) 
and 
Peciaileee me 
(eee = ry ao 1 25, (100) 
| ALLEL. 
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Fig. 11— A typical plot when Chebyshev polynomials are used to approximate 
the ideal noise performance characteristic. Even though n is a discrete variable 
the plot is given for all n > 1. 


§ It can be shown from equation 1 that w./W S Vj{5 for 2W/Q = 3. 
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From equations 99 and 100 and Fig. 11, we conclude that &,, is a 
monotonically decreasing function of n, but that no great improvement 
in the value of ¢2,;, is obtained by using very high values of n. Since the 
complexity of the network increases with n, we shall only consider the 
cases n = 1, 2. For n = 2, e2;, attains 1.25 times the minimum possible 
value. For n = 1 the Butterworth and Chebyshev approximations are 
the same. For n = 2, from equation 98 


2 2 
Emin = 2R,GMt(*) (©) ’ (101) 


Wo 
and from equations 94, 95, and 101, we can write 
2 2)2 1 2 
spa se (1 (2) opmryn" + 2) + 3(2pW)’ 








8(p) = 


Equations 31, 48, and 102 show that 











1a wt 
OS RG a 3 2 We oe 2 we 
DLP, tee ree 


(103) 


The lossless interstage network realizing Y,(p) in equation 103 is 
shown in Fig. 12. 

Similar methods can be used to determine the lossless interstage 
networks when n > 2. We have shown however that no great im- 
provement can be obtained by using very high values of n. 


5.3 Approximations with Greater than Optimum Noise Factor 


In the preceding parts of this section we used Butterworth and 
Chebyshev polynomials to approximate the ideal broadband noise 
performance characteristic of the IF amplifier in such a way that the 
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Fig. 12—Lossless interstage network for a Chebyshev approximation of order 
n = 2, The ideal transformer ratio ¢ is given by ¢ = VRG,, - 
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minimum passband noise factor is Fra. These polynomials also can 
be used?) 18 in a manner in which the minimum passband noise fac- 
tor is KF' po where 


K 21. (104) 
Such approximations are given by 
hee K(r a aE ae ey") (105) 
ee n ue 2a W , 
and 
Fp == + K(F _ aE + ene( 8), (106) 
R RO n 9 WwW 


where 7’, (x) is an n** degree Chebyshev polynomial. 
We can seen from equations 105 and 106 that 


Pe See  K Se ies. (107) 


If equations 73 through 77 are to be satisfied, it can be shown from 
equation 105 that 


ARG pM 1/2n | i] (22)(s :) 
[1 - ~7t+ oe K | ~|l-gz] Ship) simag: 208) 


Also, if Fz in equation 106 is to be realizable, it can be shown that the 
following constraint must be satisfied: 


(1 -2+ 4R,G.M./K) | 
sinh | 4 sink? S&L 


€ 


1\3 
1 (es: 
; Mh eh ( i) (2) ( wT 
— sinh i errors < oI \W sin me (109) 


In general, for arbitrary n, equations 108 and 109 can only be solved 
numerically. The numerical solution of these two equations requires 
that the value of R,G,,M, , w/w. , and w,/W be known. For any specific 
IF amplifier, the values of K and ¢’ can be determined from equations 
108 and 109 and the interstage network can then be synthesized. Since 
we do not propose to go into the characteristics of the IF amplifier, 
we shall not consider these two equations any more in this paper. 
Minimum average noise factor approximation, least-squares approxi- 
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® 3 
mation,” 


and the like also can be used in the theory of broadband 
noise performance of the optical receiver. If these approximations satisfy 
the restrictions which are imposed by the photodiode, and which are 
given in equations 73 through 77, the methods given in Section IV can 
be used to obtain a positive-real Y,(p). This Y,(p) enables us to deter- 
mine the lossless interstage network required in the broadband noise 
performance of the optical receiver. 


VI. GAIN AND NOISE FACTOR 


It has been shown for an optical heterodyne receivert? that the 
available output power P,, must satisfy the constraint given by 


a aA 2) See (w/w.)” — ae <2 (110) 














RPP, A. We 
P.. ee 
wheret 
Re Yoo 
R, = 111 
| Yo |? 
| Yr2¥or | Yr2Yor | afx2 = 
Gi, = 2 Re Yoo " meee 
, = 2 Re Yu) Re (yx) — Re (Yr2ya1) (113) 
| Yi12Ya1 | 
. 
Pee o.re(**) (114) 
G) 
2 
Yo = 3G, n.-(%2) PP, M19) 
Gams = | 2 ae 
Pia Nh oS. 


Equation 110 is identical in form to equation 77, and it can be 
shown from Ref. 13 that obtaining the broadband signal and noise 
performance characteristics of the optical receiver are analogous 
problems. It can also be shown from Ref. 13 that if Butterworth and 
Chebyshev approximations of the form given by 


K’ 
or — w?\’ 
1+( =) 


{ For an IF amplifier which is absolutely stable, it can be shown™* that A > 1. 


Ess SSP ona 0 < K’ =< 1 (117) 
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and 


K” 
Pe = Pose Te eee a re 0 < Ke < 1 (118) 


w\’ 
1+ er(4s") 


are used for the available output power of the optical receiver} reali- 
zability by lossless interstage networks requires that 





Ww 2n 
(119) 


[1 mae so + AK'RjGogGamaxl Ts [1 a Ky Ss (#-)(e) sin x. 


and 





sinh E sinh * 1 (= Kh" + 4K RG Ga max)” | 


€ 


7 Qa KY (0) 1) gy 2 
sinh | sinh = \o/\w sin 5 (120) 


We now notice that equation 119 is similar in form to equation 108 
and 120 is similar to 109. However, it can be shown that the element 
values of the lossless interstage network obtained by solving either 
equation 119 or 120 will not be identical to those obtained by solving 
either 108 or 109. This shows that the problem of broadband noise 
performance, in general, requires a network different from that re- 
quired for obtaining the broadband signal performance of the optical 
receiver, But for K = K’ = K” = 1, and for Gog = Gy, it can be 
shownt from equations 108, 109, 119, and 120 that the network which 
achieves broadband signal performance for the optical receiver also 
achieves broadband noise performance. 

For a single stage common emitter transistor IF amplifier (see 
Fig. 13), we can show that the source conductance G,,; for minimum 
noise factor is approximately equal to the source conductance G,, for 
‘maximum available gain.§ We can then say that a common emitter 
transistor IF amplifier can be used with advantage in obtaining si- 
multaneously broadband signal and noise performance from the opti- 
cal receiver. 


t+ We can compare equation 117 to equation 105 and equation 118 to equation 
6 


t We have assumed Bog = Boy = 0 for the IF amplifier. 

§ In fact it can be shown (see eo 39) that for reasonable transistor parameters 
and frequencies below (1 — ao)fa , Gos is always within a factor of /2 of the common 
emitter Go . ao is the low frequency alpha of the transistor and f, is the alpha cutoff 
frequency. 
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Fig. 13 — Simplified signal and noise equivalent circuit for the transistor. 


VII. RESULTS AND CONCLUSIONS 


A theory of obtaining broadband noise performance from an optical 
heterodyne receiver is presented in this paper. It is shown that the 
following constraint must be satisfied by any lossless interstage net- 
work used for obtaining broadband noise performance from the op- 
tical receiver: 


17° 1 SkT, R»Gss (w/w.)’ 2 
=| Jini + Sep Be ole de =, a 
This equation 77 shows that it is impossible to make Fz equal to Fro 
for any nonzero band of frequencies and for any realizable lossless 
interstage networks. 

We then consider certain types of rational function approxima- 
tions to an ideal noise performance characteristic of the optical re- 
ceiver. We show that Butterworth approximations to an ideal char- 
acteristic are realizable, but that the broadband noise performance of 
the receiver deteriorates with increasing values of n, the order of com- 
plexity of the interstage network. By approximating the ideal char- 
acteristic by Chebyshev polynomials, it can be shown that the per- 
formance improves with n, but no great improvement can be obtained 
by using very high values of n. We have shown that the performance 
for n = 2 is slightly worse than for n = oo. Realizations of networks 
for n = 1, 2 are given. 
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We also consider the problem of obtaining simultaneously both 
signal and noise broadband performance from the optical heterodyne 
receiver and show that, in general, these two problems require two 
separate lossless interstage networks. We then show that for a com- 
mon emitter transistor IF amplifier, and for certain types of Butter- 
worth and Chebyshev approximations, these two networks turn out 
to be identical. 

We give design methods and equations for any kind of rational 
function approximations to an ideal broadband noise performance 
characteristic of the optical receiver, and explicitly state the con- 
straints to be satisfied by these approximations. 

As is evident from Section IV, the theory of broadband noise per- 
formance presented in this paper for an optical heterodyne receiver 
can be applied to any other linear twoport network driven by a 
source whose internal admittance is a function of frequency. 
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Approximate and Exact Results Concerning 
Zeros of Gaussian Noise 


By A. J. RAINAL 


I. INTRODUCTION 


Let + denote the interval between two successive zeros of a station- 
ary gaussian process having zero-mean and one-sided power spectral 
density W(f). We shall refer to such an interval as a zero-crossing 
interval. This brief is concerned with these probability functions: 

(t) P(r) = Probability density of a zero-crossing interval. 

(uw) F.(r) = Probability that a zero-crossing interval lasts longer 
than r. 


Thus, F,(r) and P,(r) are related by 


eee Pur et = i Pde. (1) 


An exact, explicit solution for P(r) or F,(r) in terms of arbitrary 
W (f) is at present unknown. 

In a very interesting paper, E. Wong? presented exact, explicit 
solutions for both P,(r) and F,(r) for the special case when 


16V3/3 


VO =F + QeT +H @) 
where w = 2rf. 
The corresponding autocorrelation function p(r) is given by 
p(r) = / W(f) cos 2rfr df = ae eee Ve (3) 
0 


Wong’s exact, explicit solutions are in terms of complete elliptic 
integrals, and they stemmed from a recent result in the theory of 
Brownian motion. 

The purpose of this brief is to compare Wong’s exact results with 
the approximate results of McFadden.2, McFadden’s approximate 
results stem from the numerical solution of an integral equation, and 
they are based on the assumption of. “quasi-independence” which 
assumes that a given zero-crossing interval is statistically independent 
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of the sum of the previous (2m+2) zero-crossing intervals for all 
non-negative integral m. 

We shall see that McFadden’s approximate results compare well 
with Wong’s exact results over a significant range of r. 


II, COMPARISON OF APPROXIMATE AND EXACT RESULTS 


Figure 1 compares McFadden’s approximate result P,(7) with Wong’s 
exact result P,(r). The exact first moment of P,(r) follows from Rice’s 
work® and is indicated in Figure 1 as E(r) = 1. Thus, the approximate 
and exact results for P,(v) compare well over a significant range of 7. 

Figure 2 compares McFadden’s approximate result /,(r) with Wong’s 
exact result F’,(r). From Wong’s equation 31 we have that as r > ~, 
F(t) ~ Ce’? where C is a known constant. The semilog plot 
in Figure 2 shows this asymptotic exponential decay of F,(r). 


III. CONCLUSION 


McFadden’s approximate results P,(7), F,(r) compare well with 
Wong’s exact results P,(r), F,(r) over a significant range of r. 


&32 16V3/3" - 
me (w2+3) w2+1/3) 


= MCFADDEN'S APPROXIMATION 
= WONG'S EXACT RESULT 














Fig. 1 — Comparison of approximate and exact results for P.(r), the probability 
density of a zero-crossing interval of gaussian noise having power spectral 
density W(f). 
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_ Fig. 2— Comparison of approximate and exact results for Fo(r), the probabil- 
ity that a zero-crossing interval lasts longer than r. 
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Erratum 


On page 205 of the February 1968 Bell System Technical Journal, 
the drawings of Figs. 10 and 11 were inadvertently transposed. Fig. 
10 is the drawing with the gate electrode marked —100V, and Fig. 11 
is the drawing with the gate electrode marked +100V. 


